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SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION: 
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(i) APPLICANT: 

(A) NAME: COMMONWEALTH SCIENTIFIC AND INDUSTRIAL 
RESEARCH ORGANISATION 

(B) STREET: Limestone Avenue 

(C) CITY: Campbell 

(D) STATE: ACT 

(E) COUNTRY: AUSTRALIA 

(F) POSTAL CODE (ZIP): 2612 

(A) NAME: THE AUSTRALIAN NATIONAL UNIVERSITY 

(B) STREET: BRIAN LEWIS CRESCENT 

(C) CITY: ACTON 

(D) STATE: ACT 

(E) COUNTRY: AUSTRALIA 

(F) POSTAL CODE (ZIP): 2601 

(A) NAME: GOODMAN FIELDER LIMITED 

(B) STREET: LEVEL 42, GROSVENOR PLACE 

(C) CITY: SYDNEY 

(D) STATE: NSW 

(E) COUNTRY: AUSTRALIA 

(F) POSTAL CODE (ZIP): 2000 




(A) NAME: GROUPE LIMAGRAIN PACIFIC PTY LIMITED 

(B) STREET: LEVEL 3 1 f 1 O'CONNELI/STREET 

(C) CITY: SYDNEY 

(D) STATE: NSW 

(E) COUNTRY: AUSTRALIA 

(F) POSTAL CODE (ZIP): 2000 

(ii) TITLE OF INVENTION: REGULATION OF GENE EXPRESSION IN PLANTS 

(iii) NUMBER OF SEQUENCE-^ 17 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE^oppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS-DOS 

(D) SOFTWARE/Patentln Release #1 .0, Version #1 .30 (EPO) 

(2) INFORMATIObTTOR SEQ ID NO: 1: 

(i) SEQUENCB^CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYRE: nucleic acid 

(C) STR ANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "per primer based on the N-tcrminai sequence of wSBE I 5 ' .end at 

positi/n 168 of SEQ ID NO:5" 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: triticum lauschii 
(F) TISSUE TYPE: Endosperm 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

10 

GGC ACGCG AG AG ACTGG \ 7 

(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 
1 5 (A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "per primer in which 5 ■ end is at position 1590 of SEQ ID NO:5" 

(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: triticum tauschii 

(F) TISSUE TYPE; Endosperm 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

3 5 TACATTTCCT TGTCCATCA 1 9 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 

4 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

4 5 (A) DESCRIPTION: /dese = "per primer 5 ' end is at position 1 of SEQ ID NO:5" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: 

50 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: triticum tauschii 

5 5 (F) TISSUE TYPE: Endosperm 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATCACG AG AG CTTGCTCA 1 8 

5 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STR ANDEDNES S : single 
1 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "per primer 5 ' end is at position 334 of SEQ ID NO:5" 

1 5 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: triticum tauschii 
(F) TISSUE TYPE: Endosperm 

2 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CGGTACACAG TTGCGTCATT TTC 23 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

40 (iv) ANTI-SENSE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: triticum tauschii 
(F) TISSUE TYPE: Endosperm 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATCGACGAAG 


ATGCTCTGCC 


TCACCGCCCC 


CTCCTGCTCG 


CCATCTCTCC 


CGCCGCGCCC 


60 


CTCCCGTCCC 


GCTGCTGACC 


GGCCCGGACC 


GGGGATTTCG 


GCCAAGAGCA 


AGTTCTCTGT 


120 


TCCCGTGTCT 


GCGCCAAGAG 


ACTACACCAT 


GGCAACAGCT 


GAAGATGGTG 


TTGGCGACCT 


180 


TCCGATATAC 


GATCTGGATC 


CGAAGTTTGC 


CGGCTTCAAG 


GAACACTTCA 


GTTATAGGAT 


240 


GAAAAAGTAC 


CTTGACCAGA 


AACATTCGAT 


TGAGAAGCAC 


GAGGGAGGCC 


TTGAAGAGTT 


300 


CTCTAAAGGC 


TATTTGAAGT 


TTGGGATCAA 


CACAGAAAAT 


GACGCAACTG 


TGTACCGGGA 


360 



20 



30 



35 
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ATGGGCCCCT GCAGCAATGG ATGCACAACT TATTGGTGAC TTCAACAACT GGAATGGCTC 420 
s TGGGCACAGG ATGACAAAGG ATAATTATGG TGTTTGGTCA ATCAGGATTT CCCATGTCAA 4 80 
TGGGAAACCT GCCATCCCCC ATAATTCCAA GGTTAAATTT CGATTTCACC GTGGAGATGG 540 
ACTATGGGTC GATCGGGTTC CTGCATGGAT TCGTTATGCA ACTTTTGACG CCTCTAAATT 600 
10 TGGAGCTCCA TATGACGGTG TTCACTGGGA TCCACCTTCT GGTGAAAGGT ATGTGTTTAA 660 
GCATCCTCGG CCTCGAAAGC CTGACGCTCC ACGTATTTAC GAGGCTCATG TGGGGATGAG 720 
i5 TGGTGAGAGG CCTGAAGTAA GCACATACAG AGAATTTGCA GACAATGTGT TACCGCGCAT 7 80 
AAAGGCAAAC AACTACAACA CAGTTCAGCT GATGGCAATC ATGGAACATT CCATATTATG 840 
CTTCTTTTGG TACCATGTGA CGAATTTCTT CGCAGTTAGC AGCAGATCAG GAACACCAGA 900 

2 0 GGACCTCAAA TATCTTGTTG ACAAGGCACA TAGCTTAGGG TTGCGTGTTC TGATGGATGT 960 

TGTCCATAGC CATGCGAGCA GTAATATGAC AGATGGTCTA AATGGCTATG ATGTTGGACA 1020 
25 AAACACACAG GAGTCCTATT TCCATACAGG AG AAAGGGG T TATCATAAAC TGTGGGATAG 1080 
TCGCCTGTTC AACTATGCCA ATTGGGAGGT CTTACGGTAT CTTCTTTCTA ATCTGAGATA 1140 
TTGGATGGAC GAATTCATGT TTGACGGCTT CCGATTTGAT GGAGTAACAT CCATGCTATA 1200 

3 0 TAATCACCAT GGTATCAATA TGTCATTCGC TGGAAATTAC AAGGAATATT TTGGTTTGGA 12 60 

TACCGATGTA GATGCAGTTG TTTACATGAT GCTTGCGAAC CATTTAATGC ACAAAATCTT 1320 
35 GCCAGAAGCA ACTGTTGTTG CAGAAGATGT TTCAGGCATG CCAGTGCTTT GTCGGTCAGT 13 80 
TGATGAAGGT GGAGTAGGGT TTGACTATCG CCTTGCTATG GCTATTCCTG ATAGATGGAT 1440 
TGACTACTTG AAGAACAAAG ATGACCTTGA ATGGTCAATG AGTGCAATAG CACATACTCT 1500 
GACCAACAGG AGATATACGG AAAAGTGCAT TGCATATGCT GAGAGCCACG ATCAGTCTAT 1560 
TGTTGGCGAC AAGACTATGG CATTTCTCTT GATGGACAAG GAAATGTATA CTGGCATGTC 1620 
45 AGACTTGCAG CCTGCTTCAC CTACAATTGA TCGTGGAATT G C ACTTC AAA AGATGATTCA 1680 
CTTCATCACC ATGGCCCTTG GAGGTGATGG CTACTTGAAT TTTATGGGTA ATGAGTTTGG 1740 
CCACCCAGAA TGGATTGACT TTCCAAGAGA AGGCAACAAC TGGAGTTATG ATAAATGCAG 1800 
ACGCCAGTGG AGCCTCTCAG ACATTGATCA CCTACGATAC AAGTACATGA ACGCATTTGA 1860 
TCAAGCAATG AATGCGCTCG ACGACAAGTT TTCCTTCCTA TCGTCATCAA AGCAGATTGT 192 0 
55 CAGCGACATG AATGAGGAAA AGAAGATTAT TGTATTTGAA CGTGGAGATC TGGTCTTCGT 1980 
CTTCAATTTT CATCCCAGTA AAACTTATGA TGGTTACAAA GTCGGATGTG ATTTGCCTGG 2040 
GAAGTACAAG GTAGCTCTGG ACTCCGATGC TCTGATGTTT GGTGGACATG GAAGAGTGGC 2100 
CCAGTACAAC GATCACTTCA CGTCACCTGA AGGAGTACCA GGAGTACCTG AAACAAACTT 2160 
CAACAACCGC CCTAATTCAT TCAAAGTCCT GTCTCCACCC CGCACTTGTG TGGCTTACTA 2220 
TCGCGTCGAG GAAAAAGCGG AAAAGCCTAA GGATGAAGGA GCTGCTTCTT GGGGCAAAGC 22 80 
TGCTCCTGGG TACATCGATG TTGAAGCCAC TCGTGTCAAA GACGCAGCAG ATGGTGAGGC 2 3 40 
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GACTTCTGGT TCCAAAAAGG CGTCTACAGG AGGTGACTCC AGCAAGAAGG GAATTAACTT 2 4 00 

TGTCTTCGGG TCACCTGACA AAGATAACAA ATAAGCACCA TATCAACGCT TGATCAGAAC 2 460 

5 CGTGTACCGA CGTCCTTGTA ATATTCCTGC TATTGCTAGT AGTAGCAATA CTGTCAAACT 2520 

GTGCAGACTT GAGATTCTGG CTTGGACTTT GCTGAGGTTA CCTACTATAT AGAAAGATAA 2 580 

ATAAGAGGTG ATGGTGCGGG TCGAGTCCGG CTATATGTGC CAAATATGCG CCATCCCGAG 264 0 
TCCTCTGTCA TAAAGGAAGT TTCGGGCTTT CAGCCCAGAA TAAAAAA 2 6 87 



10 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 
1 5 (A) LENGTH: 807 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: 

25 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: triticum tauschii 
(F) TISSUE TYPE: Endosperm 

3 0 (ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: I. .807 

(D) OTHER INFORMATION :/Iabe I = sbel 
/note= "deduced amino acid sequence from SEQ ID NO:5 M 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Leu Cys Leu Thr Ala Pro Ser Cys Ser Pro Ser Leu Pro Pro Arg 
1 5 10 15 

40 

Pro Ser Arg Pro Ala Ala Asp Arg Pro Gly Pro Gly lie Ser Ala Lys 
20 25 30 

Ser Lys Phe Ser Val Pro Val Ser Ala Pro Arg Asp Tyr Thr Met Ala 
45 35 40 45 



50 



55 



Thr Ala Glu Asp Gly Val Gly Asp Leu Pro lie Tyr Asp Leu Asp Pro 
50 55 60 

Lys Phe Ala Gly Phe Lys Glu His Phe Ser Tyr Arg Met Lys Lys Tyr 
65 70 75 80 

Leu Asp Gin Lys His Ser lie Glu Lys His Glu Gly Gly Leu Glu Glu 
85 90 95 

Phe Ser Lys Gly Tyr Leu Lys Phe Gly lie Asn Thr Glu Asn Asp Ala 
100 105 HO 



Thr Val Tyr Arg Glu Trp Ala Pro Ala Ala Met Asp Ala Gin Leu lie 
60 115 120 125 
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Gly Asp Phe Asn Asn Tr? Asn Gly Ser Gly His Arg Met Thr Lys Asp 
13u 135 140 

Asn Tyr Gly Val Trp Ser He Arg He Ser His Val Asn Gly Lys Pro 
145 -50 155 i 6 o 

Ala He Pro His Asn Ser Lys Val Lys Phe Arg Phe His Arg Gly Asp 
165 170 175 

Gly Leu Trp Val Asp Arg Val Pro Ala Trp He Arg Tyr Ala Thr Phe 
180 185 i 9 o 

Asp Ala Ser Lys Phe Gly Ala Pro Tyr Asp Gly Val His Trp Asp Pro 
195 200 205 

Pr ° ?f I Gly Glu Arg ^ Val Phe L ^ s Kis Pro Ar 3 Pro. Arg Lys Pro 
21° 215 220 

OH ^? A1S Pr ° Ars Ile T * r Glu Ala His Val G ly Met Ser Gly Glu Arg 

20 225 230 235 240 

Pro Glu Val Ser Thr Tyr Arg Glu Phe Ala Asp Asn Val Leu Pro Arg 
245 250 255 

He Lys Ala Asn Asn Tyr Asn Thr Val Gin Leu Met Ala He Met Glu 
260 265 270 

His Ser Ile Leu Cys Phe Phe Trp Tyr His Val Thr Asn Phe Phe Ala 
275 280 285 

Val Ser Ser Arg Ser Gly Thr Pro Glu Asp Leu Lys Tyr Leu Val Asp 
290 295 300 



30 



40 



45 



,r Ly ? Ala His Ser Le u Gly Leu Arg Val Leu Met Asp Val Val His Ser 

35 305 310 315 320 

His Ala Ser Ser Asn Met Thr Asp Gly Leu Asn Gly Tyr Asp Val Gly 
325 330 335 

Gin Asn Thr Gin Glu Ser Tyr Phe His Thr Gly Glu Arg Gly Tyr His 
340 345 350 

Lys Leu Trp Asp Ser Arg Leu Phe Asn Tyr Ala Asn Trp Glu Val Leu 
355 360 3 6 5 

Arg Tyr Leu Leu Ser Asn Leu Arg Tyr Trp Met Asp Glu Phe Met Phe 
370 375 380 

*n A f? Gly PhS ArS Phe Asp Val Thr Ser Met L eu Tyr Asn His His 

b ° 385 3 90 395 400 

Gly He Asn Met Ser Phe Ala Gly Asn Tyr Lys Glu Tyr Phe Gly Leu 
405 410 415 

Asp Thr Asp Val Asp Ala Val Val Tyr Met Met Leu Ala Asn His Leu 
420 425 430 

Met His Lys He Leu Pro Glu Ala Thr Val Val Ala Glu Asp Val Ser 
43 5 440 445 

Gly Met Pro Val Leu Cys Arg Ser Val Asp Glu Gly Gly Val Gly Phe 
450 455 460 
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Asp Tyr Arg Leu Ala Met Ala lie Pro Asp Arg Trp lie Asp Tyr Leu 

465 470 475 480 

Lys Asn Lys Asp Asp Leu Glu Trp Ser Met Ser Ala lie Ala His Thr 

5 485 490 495 

Leu Thr Asn Arg Arg Tyr Thr Glu Lys Cys lie Ala Tyr Ala Glu Ser 

500 505 510 

10 His Asp Gin Ser lie Val Gly Asp Lys Thr Met Ala Phe Leu Leu Met 

515 520 525 



15 



30 



45 



60 



Asp Lys Glu Met Tyr Thr Gly Met Ser Asp Leu Gin Pro Ala Ser Pro 

530 535 540 

Thr lie Asp Arg Gly He Ala Leu Gin Lys Met He His Phe He Thr 
545 550 555 560 



Met Ala Leu Gly Gly Asp Gly Tyr Leu Asn Phe Met Gly Asn Glu Phe 
20 565 570 575 

Gly His Pro Glu Trp lie Asp Phe Pro Arg Glu Gly Asn Asn Trp Ser 
580 585 590 

2 5 Tyr Asp Lys Cys Arg Arg Gin Trp Ser Leu S.er Asp He Asp His Leu 

595 600 605 



Arg Tyr Lys Tyr Met Asn Ala Phe Asp Gin Ala Met Asn Ala Leu Asp 

610 615 620 

Asp Lys Phe Ser Phe Leu Ser Ser Ser Lys Gin He Val Ser Asp Met 

625 630 635 640 



Asn Glu Glu Lys Lys He He Val Phe Glu Arg Gly Asp Leu Val Phe 
35 645 650 655 

Val Phe Asn Phe His Pro Ser Lys Thr Tyr Asp Gly Tyr Lys Val Gly 

660 665 670 

4 0 Cys Asp Leu Pro Gly Lys Tyr Lys Val Ala Leu Asp Ser Asp Ala Leu 

675 680 685 



Met Phe Gly Gly His Gly Arg Val Ala Gin Tyr Asn Asp His Phe Thr 

690 695 700 

Ser Pro Glu Gly Val Pro Gly Val Pro Glu Thr Asn Phe Asn Asn Arg 

705 710 715 720 



Pro Asn Ser Phe Lys Val Leu Ser Pro Pro Arg Thr Cys Val Ala Tyr 
50 725 730 735 

Tyr Arg Val Glu Glu Lys Ala Glu Lys Pro Lys Asp Glu Gly Ala Ala 
740 745 750 

55 Ser Trp Gly Lys Ala Ala Pro Gly Tyr He Asp Val Glu Ala Thr Arg 

755 760 765 

Val Lys Asp Ala Ala Asp Gly Glu Ala Thr Ser Gly Ser Lys Lys Ala 
770 775 780 



Ser Thr Gly Gly Asp Ser Ser Lys Lys Gly He Asn Phe Val Phe Gly 
785 790 795 800 
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Ser Pro Asp Lys Asp Asn Lys 
805 

(2) INFORMATION FOR SEQ ID NO: 7: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: iriticum tauschii 
(F) TISSUE TYPE: Endosperm 

20 

(ix) FEATURE: 

(A) NAME/KEY: misc.signal 

(B) LOCATIONS. .3 19 

(D) OTHER INFORMATION :/f unction= "3' untranslated region 
25 of wSBE I-D4 cDNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCGACTTCTG GTTCCAAAAA GGCGTCTACA GGAGGTGACT CCAGCAAGAA GGGAATTAAC 60 
TTTGTCTTCG GGTCACCTGA CAAAGATAAC AAATAAGCAC CATATCAACG CTTGATCAGA 120 
ACCGTGTACC GACGTCCTTG TAATATTCCT GCTATTGCTA GTAGTAGCAA TACTGTCAAA 18 0 

3 5 CTGTGCAGAC TTGAGATTCT GGCTTGGACT TTGCTGAGGT TACCTACTAT ATAGAAAGAT 24 0 

AAATAAGAGG TGATGGTGCG GGTCG AG TCC GGCTATATGT GCCAAATATG CGCCATCCCG 3 00 
AGTCCTCTGT CATAAAGGA 319 

40 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4890 base pairs 

(B) TYPE: nucleic acid 

4 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

5 0 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: 

(vi) ORIGINAL SOURCE: 
5 5 (A) ORGANISM: triticum tauschii 
(F) TISSUE TYPE: Endosperm 

(ix) FEATURE: 



wo 
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(A) NAME/ KEY: promoter 

(B) LOCATION: 1 ..4890 

(D) OTHER INFORMATION:/function= "promoter containing 
sequence of SBE I" 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGGTGGCGGG TCGGGCGGCA AGGCGCGGGG CGGCGGGGCG GCCGGGGCGG CGCGGCGGCG 60 
10 CGGGCGGCAG CGGCGGCTAG GGTTTCGCGG CGGCGGCGAC TTGGGCTGAG GCGGGGCACG 120 
GGCTGCGGCT TTAAAGGCCG GCCAGGCTGA GGTGTCCGGG TCGGACACGG CCCGTAAGGC 180 
GGTTGACTTT AAAAAATAAT AATTCGGACA TGCAAAAAAG TAAGAAAAGA AATAATAAAC 2 40 

15 

GGACTCCAAA AATCCCGAAG TAAATTTTTC CCCATTCTTA AAAATAAGCC GGACAAGATG 3 00 
AACATTTATT TGGGCCTAAA ATGCAATTTT GAAAAATGCG TATTTTTCCT AATTCGGAAT 3 60 
2 0 AAAATCAAAT AAAATCCAAA TAAAATCAAA TATTTGTTTT TAATATTTTT CCTCCAATAT 420 
TTCATTATTT GTGAAGAAGT CATTTTATCC CATCTCATAT ATTTTGATAT GAAATATTTT 480 
CGGAGAGAAA AATAATTAAA ACAAATGATC CTATTTTCAA AATTTGAGAA AACCCAAATA 54 0 
TGAAAATAAC GAAATCCCCA ACTCTCTCCG TGGGTCCTTG AGTTGCGTGA AATTTCTAGG 600 



in 25 

t5 ATCACAAATC AAAATGCAAT AAAATATGAT ATGCATGATG ATCTAATGTA TAACATTCCA 660 



3 0 ATTGAAAATT TGGGATGTTA CATATAACTC AAATTCTATA ATTATGAACA CAGAAATATT 72 0 

AATGTAGAAC TCTATTTTGT TTTGAAATTG TATTATTTTT TAGAATTAGT CTAGAGCATT 780 

TCGTGAACTT GAATCAAACC TTTAAATAAA ACAAAGCATA AAAATGACAA ATTCACATAT 84 0 

35 

GAAATAACTT GTGTTACATA GAT TT ATT AC AATAGCGTTG TATGTGTGTA TGTGTGCGTG 900 

AGTGCCTATG GTAATATCAA TAAATATCTT GATAGATGTT TCTACAATTC ACGGGTCTAA 9 60 

4 0 CTAGTAATGC AATGCAATGC ATGCTAAAAG AATAGAACCT TAGTTTCATT TAACTAACAA 102 0 

TTTTCAAATG TATGAGTTGC CAACAAGTGG CATACTTGGC ACTG TTTGTT TGTTCATTTT 108 0 

ATGGAAAGTT CTTCTCTTTT TACATGGTTT AG ATTCC AGC ATGTAGCCAC AAAATATGAT 114 0 

45 

TGTCAAAAGA TAATACCTCA TAATACAATT CCACTAAAGT CACCTAGCCC AAGTGACCGA 12 00 

CCTGATCCTG AAATAAAATC AGAAGATTTG GTGTCATCAT CATGACAACA AATTATTAGG 1260 

5 0 CGGTAGATCT TGTGGTAGTA CTCATGATGT AAAATTATCA AGAGGGAGAG AATGTATGGA 132 0 

GATTTATGTG AAGTACATCG TACACCAGAC ATAGTTGACA CATCGATTTT TTAAGATACA 13 80 

TTTGGACGCG CCTTGTGGGA GTGTAAAGTA CTACCATGTA TTAGAAGAGG TGAAATGAGA 1440 

AATGCCATAG CTAGCAAGTA GGCCTAGTTA AGGAAATTCT TCCTTAGATC CCCTTCTCCC 150 0 

GAAGAGTGAA GTGCTTCAAC TAAAGGTTAG ACCCACTTAA AAAATGTCAC TTTGAATCTT 1560 

6 0 TGCTTCCCTT GTCGTAATCC TGTGCATTTG TAGGTCCCTC GGATCTGAGC CCTTTCTCCA 162 0 

AGCCCTTCAT TGGATTCCCC TGGATGTCTT TTTGTTACAT TTTATTGAAG TGAGAGTGAA 16 8 0 

TTATTATATG CCCATAGGAG GTGGGATATA AAGGCTGTTG GTATTCTGCA CCATACATGC 174 0 

65 



55 
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TAGAGTAGGG AGGAGAGGCT GGTGCATGAT ACATGGTGGA CTAGCCCATA TATTTACCCC 1800 
TCCCCCACCC ACTAACAAGT TTTTTTTATT AGGTCTTCAT CCTCTGATTT GTTTTTCTGT I8 60 
5 TAGCCCATTC TTCATCATGG ACTTATTAAT CATGATTAGT TTCTTGGATT TTTGTTTACT 19 2 0 
TGACTTGAAT TTGACAATGT GCCTCATATA TGGCATGTGG GACTGATAGG AAGATATATT 19 80 
CTCACAACAT TAACTTAAAA AGGATTATTT TTTTGGTGCA GTCGTAAAGA AAACTACTTT 2040 
CTTTTATGCT AAAAGTTATT CAAACATAGA TTTATAAACA AAGGATATCA CCATGCATGA 2100 
CCATGCGCTC TCTCATGTTT ACTCTAGAAA CCATATATCT CTTTGTTGCA AAATATTTAA 2160 

15 TCTATCCTCC TTGTTTCTGG GAATGAGTCG GGGAAGGTAA TCTTAGGGAA GGTTAAAGTG 2220 
AGGCAAGTAA GAGCAACTCT AGCAGAGTCG CGATATGCCC AATCGCCATA ATGCCAATAT 22 80 

20 GGCATTT1W?G GCCCAAAATG GCACTTCAGA AGAGTCACCA TATCCCTTCG GATAGCCATA 23 40 
ATTTAGGGAG CTCGCTCCAC AAACAAGCTT CGAGCCTCCA AATATGGAGG CCATGGATTC 2400 
GTTGTTTGGC ACTCACTCCA TATCCAACCG CAAGCGCATG CATGAGGGAA GTTTTAGCTT 2460 

2 5 CTTCCTCCTT GCGCCAACGC CGGGATTTTA CACAGCGCAT TACAGGTACA TGAACCAGCA 2 52 0 

TGCACAGATA ATCACCGACG AGTGGGGTGA CAAGAAGGAT AAGCACCCTC CCATTAGTGG 2580 
TGCGCCCACT CCCCTCAAAT TCATGAGGCA GCCATTTGGA TGGTCATCGC GTGGCATAAG 2 64 0 
CTCCGACTAT AAAATCTCAA CGGCATCACC AAAACCATAG CTGCCGCCTC CCCCTTCCTC 2700 
GGCATCACCT CCCCAAGACA TCTCCTCCCC TCTATGCCAC AATGTCATCA TTATGGAGAG 27 60 

3 5 ACACAACTAC TGGTAAACCG CATACCCAAT CATGGTTTAC CGGCAGTGCG AACCCCACCT 2 82 0 

TCCTCCCACG ATGGTAGGAT ATTCTCCTCC TAGAATGGCG CGTGTGGCGC TTCCTCCTCC 2 880 

4 0 CGAGGCTGAT ATGTCGGCTC CCATGATGGC GTGCATCATT GATTTGGCGC TTCGGGTCCA 29 4 0 

TCATACATGT TAACGAGGTC ATCCCCATTG ATGTCGTTGG TCCCCTTGCC CCCCAGTCGG 3 000 
ATCCTGAGGA CCCGTTCGAT GTCGCAATGC GACTCTCCAA ACTCAAAGCT CACAATGAGG 3 060 

4 5 AGTACGTCCT CTAGGAGTTC CGCCCCGCAA CCATCTATAA GGAGGAGCAA CGATAGCTCT 312 0 
CCCCTACGCC TTCCTCGACG ATCTCTCTTA GGAGGACAAC GGCTAGACGA CGGCGGCGGC 3180 

^ GGCGAAGGTA CTGCAGGTAG TAGAACATAG CAATGTCGAA TGGCGACATT GCATATTTTG 324 0 
AAAATGTCGC TCAACGACTT TTGAAGTCGC AAATAAAATG TAGTGTGACT ACTTTTGGCC 33 00 
AGCAATATAA GTTTATCACA TTTGATAATG ATTTGAACCG GTGTGGTTCA ACTAAATGTA 33 60 

55 CCATAAATTG AACATACAAA TTTTTAGCAA ATGAAAAAAG AAACAAGTAA GACCACAAAT 3 42 0 
ATGAAAGCCG CATATCGCGA CTATGTGTTT GAGCCGCAGC TGCCAAGTAC ATATGAAGCG 3 48 0 
TACTCCATAT GACATACGAC AACCATACAT ATGAAGACTC TACTAGAGTT CTCTAAGGCC 3 540 

o u 

GCTTTTAGCG CCTTTCGTGC AGTGGTGCCC ATAGGGAGTG AGGGTAGTTG GACTGTTCGT 3 600 
TTCCCCTTTT TTCATTTCTT TGAAATCTAT TTTATTTTTT TTC TCTTTTG TAGGTTTCCC 3 66 0 
65 AAATTTATAT ACCATTTTTC TGTTTCTCGC TATTTTTTGT TGTTATATTC TAGTTTCATA 3 720 
TTTTTCTATT ATTAATTTGT GTCTCTTATG AGAAGTCCAG ACTTGCATAT GGAGGTGCAC 3 780 
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ACACAAACAT 


ATAAAGTATA 


AATACTAACT 


TGAGAAGTAT 


GTTTGCGTGG 


TCAAAAAAAC 


3840 






ATCATCAAAA 


CCTGCCAATA 


TGAGATATAG 


TTTTGAATAT 


ATCAATATGA 


GCAACGCAAC 


3900 






CATTTAAAAT 


GTGAACAATT 


G TTTTTTT AG 


AAAAAATATA 


AGAAATAACT 


CCAACCCAGC 


3960 


* 




CAAACCACAT 


GCTATACACT 


TGCTCCATAT 


GAAACCATGT 


TTGCTATTGG 


GCA.GTTGCCT 


4020 




10 


GAAACCGAAA 


GTAATGTTAG 


CCGTTTTTCT 


ATTCAAAGAA 


GAAGGAGAGT 


CGAGGTGACG 


4080 






CGATGCTTAG 


ACGTGAGATG 


GGGATGACCA 


CAACGTCCCT 


ACAGAGACCT 


CACCGGAGAT 


4140 




15 


GGGGACATTG 


CAGTTGACAC 


GAG AG C GGTG 


AGGGGCTGCG 


ATGCGTGTGC 


GGCAACATGT 


4200 




GGCGAGGCGG 


ACGTCGGGCT 


GGCAGGTAGG 


GGGGAGGGGG 


AAGGACCGGG 


GGAGGAAGAA 


4260 






GAGGAGTAGC 


CTGCAAAACA 


TGGTACACCA 


GTTTTCTGCC 


CTACGAAAAC 


CTCATTTCAT 


4320 




20 


TCCCCCACCC 


TGACAAGCAA 


CAACCAACCA 


TCGCAGTCCC 


ACATGTCCCT 


CTGGTCTTTG 


4380 






CAAAAAGTAA 


TTGTTCTTGC 


TGGACAGCGC 


AAAGAGTAAA 


CTTTTGTTAG 


TTTTCATTTC 


4440 


o 


25 


TAGAAAAAGC 


AATCCTTTTA 


TAGTTCTTTT 


GTGAAAGTAA 


TGC TTTTAT A 


GTGATTGGGA 


4500 


;S 


TGTTCTTTTA 


GAGCAAATAT 


CTTCTTTTTT 


TTTTAGGGAA 


AAGAGCAAAT 


ATCTTCCACT 


4560 






TTTCACAAAA 


CTGACGAAGG 


CTGAAAGTGG 


CGAGACAGTG 


AGGGCC CAT A 


GCTTTCGTCC 


4620 


m 


30 


GGCCCAGCGG 


CGCACGACCG 


TCCACGTGCA 


CCCCGGCCCT 


CCCGGGCCCG 


CAGATCCGTT 


4680 






CTCCCTCGCC 


CCCGTTTCCC 


CCTCCCTCCC 


TCTCGTTGCT 


TCCACTCCAC 


TGTTCTCCTC 


4740 




35 


TTCCTGTCCA 


AAGCGGCCAC 


GGACCGGAAA 


AAAATCACGC 


CTTTCCGTTG 


GGTCTCCGGC 


4800 




GCCACACTCC 


TCCTCCGGCC 


GATATAAAGC 


GCGCGGGGCC 


ACGGGCCCGG 


CGCAAAATGG 


4860 






GATTCCCGTC 


CGCCGCCATG 


GAGGAAGATG 


4890 







4 0 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

4 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

50 

(iv) ANTI-SENSE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: triiicum tauschii 

5 5 (F) TISSUE TYPE: Endosperm 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATIONS 

60 (D) OTHER INFORMATION:/product= "coding region of wSBE I-D4 gene" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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ACGGGCCCGG CGCAAAATGG GATTCCCGTC CGCCGCCATC GACGAAGATG CTCTGCCTCA 60 
CCGCCCCCTC CTGCTCGCCA TCTCTCCCGC CGCGCCCCTC CCGTCCCGCT GCTGACCGGC 12 0 
CCGGACCGGG GATCTCGGTG AGTCAGTCGG GATCTTCATT TCTTTTCTTT TCTTTCGTTT 180 
CCGGCTCCGT TCTGCCGGGG TTTCCCTGAT GCGATGCCGC GCGCGCGCAG GGCGGCGGCA 24 0 
ATGTGCGGCT GAGCGCGGTG CCCGCGCCCT CTTCGCTCCG CTGGTCGTGG CCGCGGAAGG 300 
TGAGCCCTCT CCCCTGTCTA CCCAGATTTG CGACCGTGAT CCCCTGTTGT CGCCGGGCAA 3 60 
ACGGAATCTG ATCCACGGTG GTTATTGGAA ATAGTATATA CTACTAATAA ACTTGAGGCT 42 0 
15 GGGATTCGTC CACTGAGGAA CAAGTGGATG CGATTTCGAT TGGATTTCTC TC-CTTTATGC 480 
GATCCGTACG CAGAATATCC CTCCTGCAGT GTCTCAACCG TATTACTGGA TC-TACAACCC 540 
20 AAATGTGTAT AATCTGTGCT GAATGTATCA ACCAATAATT GCTGCATTGT GAAAACATAA 600 
TCCTGTGTTG TGTCTCTACT ACTTGTTCAG TCCTGATCTG CCGCTTATCC TAACTTTTGT 660 
TCATTTATGG AAGGCCAAGA GCAAGTTCTC TGTTCCCGTG TCTGCGCCAA GAGACTACAC 720 
25 CATGGCAACA GCTGAAGATG GTGTTGGCGA CCTTCCGATA TACGATCTGG ATCCGAAGTT 78 0 
TGCCGGCTTC AAGGAACACT TCAGTTATAG GATGAAAAAG TACCTTGACC AGAAACATTC 840 
GATTGAGAAG CACGAGGGAG GCCTTGAAGA GTTCTCTAAA GGTTAGCTTT TGTTTCATGT 900 
GTTTGAAACA ATAGTTACAT CTTGTGGCGT CCGCAGCACA AAAGACATAA TGCGACTCTG 9 60 
TTTTGTAGGC TATTTGAAGT TTGGGATCAA CACAGAAAAT GACGCAACTG TGTACCGGGA 102 0 
3 5 ATGGGCCCCT GCAGCAATGT AAGTTCTAGT GTTGTCACGC AACTAATTGC AATGGTCGTT 1080 
GGTTAACTTA TGAAGTGCTG ATGAAACTGT CTTAAGAGTT TATGGCTTGT CTTTTCTGAT 1140 
^ o TCTAGCTAGT AAAGAGTAGA TAAATATGAA ATATGTTTTC C CTTTTCT AG TTATGGTCAT 12 00 
GGTTGGCTGG TATTCATTTC TTTTATGGCA ATACTTGCTT CTAACTATCT TTAGTAGATT 12 60 
CATGTATTTA CTTGTGAGTC ATTACTTTAT GGGTGTAGGG ATGCACAACT TATTGGTGAC 13 2 0 
45 TTCAACAACT GGAATGGCTC TGGGCACAGG ATGACAAAGG ATAATTATGG TGTTTGGTCA 13 8 0 
: ATCAGGATTT CCCATGTCAA TGGGAAACCT GCCATCCCCC ATAATTCCAA GGTTAAATTT 1440 
5q CGATTTCACC GTGGAGATGG ACTATGGGTC GATCGGGTTC CTGCATGGAT TCGTTATGCA 1500 
ACTTTTGATG CCTCTAAATT TGGAGCTCCA TATGACGGTG TTCACTGGGA TCCACCTTCT 1560 
GGTGAAAGGT CTACTTTTAG TGGCTCGAGA GCAAGAAATC TAAGTAAAAC CCACACAATT 1620 
55 AACTTACATT AATGTGGAGA CATGATACTT TTATTGCTCG TTTTGCAGGT ATGTGTTTAA 1680 
GCATCCTCGG CCTCGAAAGC CTGACGCTCC ACGTATTTAC GAGGCTCATG TGGGGATGAG 1740 
TGGTGAAAAG CCTGAAGTAA G C AC AT AC AG AGAATTTGCA GACAATGTGT TACCGCGCAT 1800 
AAAGGCAAAC AACTACAACA CAGTTCAGCT GATGGCAATC ATGGAACATT CATATTATGC 1860 
TTCTTTTGGG TACCATGTGA CGAATTTCTT CGCAGTTAGC AGCAGATCAG AACGCCAGAG 1920 
65 ACCTCAATAT CTTGTTGACA AGGCACATAG TTTACGGTTG CGTGTTCTGA TGGATGTTGT 1980 
CCATAGCCAT GCGAGCAGTA ATAAGACAGA TGGTCTTAAT GGCTATGATG TTGGGCAAAA 2040 
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CACACAGGAG TCCTATTTCC ACACAGGAGA AAGGGGCTAT CATAAACTGT GGGATAGCCG 2100 
CCTGTTCAAC TATGCCAATT GGGAGTCTTA CGATTTCTTC TTTCTAATCT GAGATATTGG 2160 

5 

ATGGACGAAT TCATGTTTGA TGGCTTCCGA TTTGATGGGG TAACATCCAT GCTATATAAT 222 0 
CACCATGGTA TCAATATGTC ATTCGCTGGA AGTTACAAGG AATATTTTGG TTTGGATACT 2 2 80 
.10 GATGTAGATG CAGTTGTTTA CCTGATGCTT GCGAACCATT TAATGCACAA ACTCTTGCCA 23 4 0 
GAAGCAACTG TTGTTGCAGA AG ATG TTTC A GGCATGCCAG TGCTTTGTCG GTCAGTTGAT 2400 
GAAGGTGGAG TAGGGTTTGA CTATCGCCTG GCTATGGCTA TTCCTGATAG ATGGATCGAC 2460 

15 

TACTTGAAGA AC AAAG ATG A CCTTGAATGG TCAATGAGTG GAATAGCACA TACTCTGACC 2 52 0 
AACAGGAGAT ATACGGAAAA GTGCATTGCA TATGCTGAGA GCCATGATCA GGTATGTTTT 25 80 
20 CCCTCCTTTG TCGCTGTGCG TGAGTATGTG TTCTTTTTTT ATGGGGCACT GGTCTAAGAA 2640 
CATACAGTTC AAAGGTGAGA CACTTTCTTT GCCTGGTAGA CAAATTTGAG AAATAAACAT 27 00 
TTCGCTTGAT GACTTTTAGT TGCTTCACAA GTTCGAATTA AGTTAGTTAT ATTCTGATAA 27 60 

25 

CTAGTGATAG TACCCACTAA CCAGCTATTA CGGACCATGT AAGAATGTCC GAAGACTGCA 282 0 
GTTATATATC GTTGACTTTG TGTTCATCTA TTGAAACAAC TTAGTAGTTA ACTTTCACGC 28 80 

3 0 AAATTTTCAG TCTATTGTTG GCGACAAGAC TATGGCATTT CTCTTGATGG ACAAGGAAAT 294 0 

GTATACTGGC ATGTCAGACT TGCAGCCTGC TTCGCCTACA ATTGATCGTG GAATTGCACT 3000 
TCAAAAGGTT CGATTCGTTT TAAGTATTCC TGAATTTGAT GTTCTAGTTC CAGACGAGTA 3 060 
TTGTAATGTT CGTTGTTACT CAGAGTTCTG CTTAGTCCTT GAAGATAATG TATTCCAGTC 312 0 
CCTTTTGGTA CATTTGGCTT ATTTTGTTAC AAATATTTCA GATGATTCAC TTCATCACCA 3180 

4 0 TGGCCCTTGG AGGTGATGGC TACTTGAATT TTATGGGTAA TGAGGTAATA TCTGGTTATC 3240 

TGTCAAAACT TATTTCTGAT CAATATGTTT CGGGATTCCC TCGAAAAAAA TCCTTTGGGC 3 3 00 
AGGGCGAAAA GTTTAAACAT CTGTTTTCTA TGATAGCCAA GTACTCCCCA GCTATTTCCA 3 3 60 

45 

TGTTATCACG TATCATTTAG CTGTGCCGGT AG TT AATCTT TATTCTAATT CATTGTTGTT 3 42 0 
TTTTAGCGTG GCAGTCTATT GTTGGATCCT CTTATTCCAA TTACATATAT GCCGACATCA 348 0 
50 CACACTTATG AATATTCCCT GTTTAAAAGA TTTTTATTTT ATACCAATGT TTCTCCGTAA 3 54 0 
ATGATGCAAA CATGATAGAG ATGTTAGCAT GTCTTTCTTA ACCTACTCAT GTTTTACATA 3 600 
TCACGACAAG CTTCTTGCAG AAAATCAGCA GTATATGGCA AATTGCTGCA ACCTGACAAC 3 660 

55 

GTTTATATCT GTTTTCTAAC TCATACTGAC GGTGCAATTT CCTTTTAGTT TGGCCACCCA 372 0 
GAATGGATTG ACTTTCCAGA AGAAGGCAAC AACTGGAGTT ATGATAAATG CAGACGCCAG 3 7 80 
60 TGGAGCCTCG CAGACATTGA TCACCTACGA TACAAGGTTA TGCCTATGTA TATTTTTACA 3 84 0 
GTTTCTGGTC TGGTAGCTCT CTTGGGATCT TGACCTCACT TAGTTCCTTC ATCTCTGACT 3 9 00 
GTAGCTTATT TACACTGTGT TCCAACTTCT GTCTTGTGGA TAAATTCTCC CTTCTAACGT 3 9 60 
TTCATATTAA GCCTTTCAAA CTAAACTAAA TTGCTGATCT ACT AC T AGTT GCTCAGTACG 4 02 0 
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ATGACCAAAT CTTGCCTGTG GTAACCTAGT AATTTTCTTG ATTCTTACAC ATTAGTGATA 4 0 80 
TGCAGTGCAT ACATTATCCA TATAAATTGA CATTGCAATT TCCCAAATAT TATTTGAAGG 4140 
5 CTG7GTTCTT TTGTTAACAG GAAGTTATTT TCTCTGCATC TGATAAATAA TAATAGCCTT 4 2 00 
TCACGATTTT TCTCATATTT TATCCAACTT TTCTGCATTC AAGCATTTTT TGTTTCTCGC 4 2 60 
iq C T AAC ATATA TAATTTGAAC AGTACATGAA CGCATTTGAT CAAGCAATGA ATGCGCTCGA 43 20 
CGACAAATTT TCCTTCCTAT CATCATCAAA GCAGATTGTC AGCGACATGA ATGAGGAAAA 43 8 0 
GAAGTAGTTA ACTATACAAT GTTTAGTCAG GGCAGCTGTT GCATCATTTG ATTCACTCCT 44 4 0 
15 ACTCTTAAGA ATAGCAACTC TGACTTGTGC GTTTTATGTT ACCAAATAAG TTGAAACCGT 4 500 
ATC7GTTTGA TATGAACCAT TGTTGTCTCA AAATGGGCTA TGGACTCAAT CCAACTTCCT 4560 
TTCCAGATTA TTGTATTTGA ACGTGGAATC TGGTCTTCGT CTTCAATTTT CATCCCAGTA 4620 
AAACTTATGA TGGGTAACTG ATCTCTTGCA AGCTTTGCCT TTCAATATTT CTTCTGCTTA 4680 
ATGACTAATG TGCTTAATCT CGTTTCCACT TTTAAAACAC GCAGTTACAA AGTCGGATGT 474 0 
25 GACTTGCCTG GG AAGTACAA GGTAGCTCTG GACTCTGATG CTCTGATGTT TGGTGGACAT 4 8 00 
GGAAGAGTAA GCAATGTTAA TGATGTTCAA GATCTGTTTT GCAACACTAT GTTCTTCTAT 48 60 
3 0 AGAAGGGGCC ATCAAGGCTG CATCAGATAA TCTTATTTGC AGTGTTGATC TGTGCTGCAT 4920 
CGCAGGTGGC CCATGACAAC GATCACTTTA CGTCACCTGA AGGAGTACCA GGAGTACCTG 49 80 
AAACAAACTT CAACAACCGC CCTAACTCAT TC AAAATCC T GTCTCCATCC CGCACTTGTG 504 0 
3 5 TGGTAATGCT AATTACTAGG AGGATTTAGT AACAATAAAT AAATAACAGC AAAAGATATC 5100 
TGCAGTACGA TCTCACAAAA TGCTCTCTTG CCAGGCTTAC TATCGCGTCG AGGAGAAAGC 5160 
^ GGAAAAGCCC AAGGATGAAG GAGCTGCTTT CTTGGGGGAA ACTGCTCTCG GGTACATCGA 522 0 
TGTTGAAGCC ACTGGCGTCA AAGACGCAGC AGATGGTGAG GCGACTTCTG GTTCCGAAAA 52 80 
GGCGTCTACA GGAGGTGACT CCAGCAAGAA GGGAATTAAC TTTGTCTTTC TGTCACCCGA 53 40 
45 CAAAGACAAC AAATAAGCAC CATATCAACG CTTGATCAGG ACCGTGTGCC GACGTCCTTG 5400 
TAATACTCCT GCTATTGCTA GTAGTAGCAA TACTGTCAAA CTGTGCAGAC TTGAAATTCT 54 60 
GGCTTGGACT TTGCTGAGGT TACCTACTAT ATAGAAAGAT AAATAAGCGG TGATGGTGCG 5520 
GGTCGAGTCC AGCTATATGT GCCAAATATG CGCCATCCCG AGTCCTCTGT CATAAAGAAA 55 80 
GTTTCGGGCT TCCATCCCAG AATAAAAACA GTTGTC TGTT TGCAATTTCT TTTTGTCTTG 5 640 
55 CATAGTTACA TGATAATTGA TGCATATTGC TATAAGCCTG GATTGCATCT TCTTTTGCTA 5700 
ATAACTGCAG GGCCAAGAAA GCCTAGATTG TATCTTTTTT TGCTAATAAC TGCAGTGCTG 57 6 0 
GGGAAGCTTC AGTCCTTGTT TCCGTTCTCG AGACAAGGCG TCATGTTTGG CGCACAAAGG 5820 
TAAGCCATCA TCTTATCAAG TCCCAAAATT CTCTGGTTGA AAGAAACCAT CACTAACTTG 5880 
TTCCAGGTGT TGGTTCCTCC ACAACCAAAA GGCGACCATC GTCGTCATCA TCGCTCACAG 594 0 
6 5 CACTGACCAT CGAAGCCACG GTGGGCATGA AATGCGCATC GCCCAAGACT TGGGACCGTT 6000 
TCAAAATATC ACAAACTGCC ATGGCATCTT CTGCCAAAGG CTGCACTGCA CCTTTGGCAT 6060 
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GAACAGAAGC AAC AGGGGCT TGGAACTGAA CGCCGAAAAT AAAGTCAAAC CGGCTGGGCC 612 0 
GGATTGAAAG GGGAAACGCC AAAATCCACT TAATTTGAAT GGAAGGAGGA ATGGTTCTTG 6180 

5 

CTGGTTTCAA CTCTGCAGGC TTCCCTCTGA ATTTCACACG GAGCCATT 6228 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 
1 0 (A) LENGTH: 1 1463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

1 5 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: 

20 

« (vi) ORIGINAL SOURCE: 

(A) ORGANISM: triticum tauschii 
(F) TISSUE TYPE: Endosperm 

2 2 5 (ix) FEATURE: 

^ (A) NAME/KEY: misc.feature 

p=J (B) LOCATION: 1„1 1463 

y z (D) OTHER INFORMATION:/produci= "complete sequence of the 

^ starch branching enzyme II gene" 

3! 30 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ill 

q AGAAACACCT CCATTTTAGA TTTTTTTTTT GTTCTTTTCG GACGGTGGGT CGTGGAGAGA 60 

;5f 3 5 TTAGCGTCTA GTTTTCTTAA AAGAACAGGC CATTTAGGCC CTGCTTTACA AAAGGCTCAA 120 

G CCAGTCCAAA ACGTCTGCTA GGATCACCAG CTGCAAAGTT AAGCGCGAGA CCACCAAAAC 180 

AGGCGCATTC GAACTGGACA GACGCTCACG CAGGAGCCCA GCACCACAGG CTTGAGCCTG 240 

40 

ACAGCGGACG TGAGTGCGTG ACACATGGGG TCATCTATGG GCGTCGGAGC AAGGAAGAGA 3 00 

GACGCACATG AACACCATGA TGATGCTATC AGGCCTGATG GAGGGAGCAA CCATGCACCT 3 60 

45 TTTCCCCTCT GG AAATTC AT AGCTCACACT TTTTTTTAAT GGAAGCAAGA GTTGGCAAAC 42 0 

ACATGCATTT TCAAACAAGG AAAATTAATT CTCAAACCAC CATGACATGC AATTCTC AAA 4 80 

CCATGCACCG ACGAGTCCAT GCGAGGTGGA AACGAAGAAC TGAAAATCAA CATCCCAGTT 54 0 

50 

GTCGAGTCGA GAAGAGGATG ACACTGAAAG TATGCGTATT ACGATTTCAT TTACATACAT 600 

GTAC AAAT AC ATAATGTACC CTACAATTTG TTTTTTGGAG CAGAGTGGTG TGGTCTTTTT 6 60 

5 5 TTTTTACACG AAAATGCCAT AGCTGGCCCG CATGCGTGCA GATCGGATGA TCGGTCGGAG 7 20 

ACGACGGACA ATCAGACACT CACCAACTGC TTTTGTCTGG GACACAATAA ATGTTTTTGT 7 80 

AAACAAAATA AATACTTATA AACGAGGGTA CTAGAGGCCG CTAACGGCAT GGCCAGGTAA 84 0 

ACGCGCTCCC AGCCGTTGGT TTGCGATCTC GTCCTCCCGC ACGCAGCGTC GCCTCCACCG 900 
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TCCGTCCGTC GCTGCCACCT CTGCTGTGCG CGCGCACGAA GGGAGGAAGA ACGAACGCCG 9 60 
CACACACACT CACACACGGC ACACTCCCCG TGGGTCCCCT TTCCGGCTTG GCGTCTATCT 1020 
CCTCTCCCCC GCCCATCCCC ATGCACTGCA CCGTACCCGC CAGCTTCCAC CCCCGCCGCA 1080 
CACGTTGCTC CCCCTTCTCA TCGCTTCTCA ATTAATATCT CCATCACTCG GGTTCCGCGC 1140 
TGCATTTCGG CCGGCGGGTT GAGTGAGATC TGGGCGACTG GCTGACTCAA TCACTACGCG 1200 
GGGATGGCGA CGTTCGCGGT GTCCGGCGCG ACTCTCGGTG TGGCGCGGGC CGGCGTCGGA 12 6 0 
GTGGCGCGGG CCGGCTCGGA GCGGAGGGGC GGGGCGGACT TGCCGTCGCT GCTCCTCAGG 13 2 0 
AAGAAGGACT CCTCTCGTAC GCCTCGCTCT CTCGAATCTC CCCCGTCTGG CTTTGGCTCC 13 80 
CCTTCTCTCT CCTCTGCGCG CGCATGGCCT GTTCGATGCT GTTCCCCAAT TGATCTCCAT 1440 
GAGTGAGAGA GATAGCTGGA TTAGGCGATC GCGCTTCCTG AACCTGTATT TTTTCCCCCG 1500 
CGGGG AAATG CGTTAGTGTC ACCCAGGCCC TGGTGTTACC ACGGCTTTGA TCATTCCTCG 1560 
TTTCATTCTG ATATATATTT TCTCATTCTT TTTCTTCCTG TTCTTGCTGT AACTGCAAGT 1620 
TGTGGCGTTT TTTCACTATT GTAGTCATCC TTGCATTTTG CAGGCGCCGT CCTGAGCCGC 1680 
GCGGCCTCTC CAGGGAAGGT CCTGGTGCCT GACGGCGAGA GGACGACTTG GCAAGTCCGG 1740 
CGCAACCTGA AGAATTACAG GTACACACAC TCGTGCCGGT AAATC TTC AT AC AATC GTT A 1800 
TTCACTTACC AAATGCCGGA TGAAACCAAC CACGGATGCG TCAGGTTTCG AGCTTCTTCT 1860 
ATCAGCATTG TGCAGTACTG CACTGCCTTG TTCATTTTGT TAGCCTTGGC CCCGTGCTGG 1920 
CTCTTGGGCC ACTGAAAAAA TCAGATGGAT GTGCATTCTA GCAAGAACTT CACAACATAA 19 80 
TGCACCGTTT GGGGTTTCGT CAGTCTGCTC T AC AATTGC T ATTTTTCGTG CTGTAGATAC 2 04 0 
CTGAAGATAT CGAGGAGCAA ACGGCGGAAG TGAACATGAC AGGGGGGACT GCAGAGAAAC 2100 
TTC AATC TTC AGAACCGACT CAGGGCATTG TGGAAACAAT CACTGATGGT GTAACCAAAG 2160 
GAGTTAAGGA ACTAGTCGTG GGGGAGAAAC CGCGAGTTGT CCCAAAACCA GGAGATGGGC 2220 
AGAAAATATA CGAGATTGAC CCAACACTGA AAGATTTTCG GAGCCATCTT GACTACCGGT 22 80 
AATGCCTACC CGCTGCTTTC GCTCATTTTG AATTAAG GTC CTTTCATCAT GCAAATTTGG 2340 
GGAACATCAA AGAGACAAAG ACTAGGGACC ACCATTTCAT ACAGATCCCT TCGTGGTCTG 2400 
AGAATATGCT GGGAAGTAAA TGTATAATTG ATGGCTACAA TTTGCTCAAA ATTGCAATAC 2 460 
GAATAACTGT CTCCGATCAT TACAATTAAA GAGTGGCAAA CTGATGAAAA TGTGGTGGAT 2520 
GGGTTATAGA TTTTACTTTG CTAATTCCTC TACCAAATTC CTAGGGGGGA AATCTACCAG 2 5 80 
TTGGGAAACT TAGTTTCTTA TCTTTGTGGC CTTTTTGTTT TGGGGAAAAC ACATTGCTAA 2640 
ATTCGAATGA TTTTGGGTAT ACCTCGGTGG ATTCAACAGA TACAGCGAAT ACAAGAGAAT 2700 
TCGTGCTGCT ATTGACCAAC ATGAAGGTGG ATTGGAAGCA TTTTCTCGTG GTTATGAAAA 27 60 
GCTTGGATTT ACCCGCAGGT AAATTTAAAG CTTTATTATT ATGAAACGCC TCCACTAGTC 2820 
TAATTGCATA TCTTATAAGA AAATTTATAA TTCCTGTTTT CCCCTCTCTT TTTTCC AGTG 2880 
CTGAAGGTAT CGTCTAATTG CATATCTTAT AAGAAAATTT ATATTCCTGT TTTCCCCTAT 2940 
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TTTCCAGTGC TGAAGGTATC ACTTACCGAG AATGGGCTCC CTGGAGCGCA TGTTATGTTC 3 000 
TTTTAAGTTC CTTAACGAGA CACCTTCCAA TTTATTGTTA ATGGTCACTA TTCACCAACT 3 060 

5 

AGCTTACTGG ACTTACAAAT TAGCTTACTG AATACTGACC AGTTACTATA AATTTATGAT 3120 
CTGGCTTTTG CACCCTGTTA CAGTCTGCAG CATTAGTAGG TGACTTCAAC AATTGGAATC 3180 
10 CAAATGCAGA TACTATGACC AGAGTATGTC TACAGCTTGG CAATTTTCCA CCTTTGCTTC 3 24 0 
AT AAC TACTG ATACATCTAT TTGTATTTAT TTAGCTGTTT GCACATTCCT TAAAGTTGAG 3 3 00 
CCTCAACTAC ATCATATCAA AATGGTATAA TTTGTCAGTG TCTTAAGCTT CAGCCCAAAG 3 3 60 

15 

ATTCTACTGA ATTTAGTCCA TCTTTTTGAG ATTGAAAATG AGTATATTAA GGATGAATGA 3 420 
ATACGTGCAA CACTCCCATC TGCATTATGT GTGCTTTTCC ATCTACAATG AGCATATTTC 3 480 

2 0 CATGCTATCA GTGAAGGTTT GCTCCTATTG ATGCAGATAT TTGATATGGT CTTTTCAGGA 3 540 

TGATTATGGT GTTTGGGAGA TTTTCCTCCC TAACAACGCT GATGGATCCT CAGCTATTCC 3 600 
TCATGGCTCA CGTGTAAAGG TAAGCTGGCC AATTATTTAG TCGAGGATGT AGCATTTTCG 3 660 

25 

AACTCTGCCT AC T AAGGGTC CCTTTTCCTC TCTGTTTTTT AGATACGGAT GGATACTCCA 37 20 
TCCGGTGTGA AGGATTCAAT TTCTGCTTGG ATCAAGTTCT CTGTGCAGGC TCCAGGTGAA 3780 

3 0 ATACCTTTCA ATGGCATATA TTATGATCCA CCTGAAGAGG TAAG TATCG A TCTACATTAC 3 840 

ATTATTAAAT GAAATTTCCA GTGTTACAGT TTTTTAATAC CCACTTCTTA CTGACATGTG 39 00 
AGTCAAGACA ATACTTTTGA ATTTGGAAGT GACATATGCA TTAATTCACC TTCTAAGGGC 3 960 

35 

TAAGGGGCAA CCAACCTTGG TGATGTGTGT ATGCTTGTGT GTGACATAAG ATC TTATAGC 4020 
TCTTTTATGT GTTCTCTGTT GGTTAGGATA TTCCATTTTG GCCTTTTGTG ACCATTTACT 4080 

4 0 AAGGATATTT ACATGCAAAT GCAGGAGAAG TATGTCTTCC AACATCTCAA CTAAACGACC 4140 

AGAGTCACTA AGGATTTATG AATCACACAT TGGAATGAGC AGCCCGGTAT GTCAATAAGT 42 00 
TATTTCACCT GTTTCTGGTC TGATGGTTTA TTCTATGGAT TTTCTAGTTC TGTTATGTAC 4260 

45 

TGTTAACATA TTACATGGTG CATTCACTTG ACAACCTCGA TTTTATTTTC TAATGTCTTC 4320 
ATATTGGCAA GTGCAAAACT TTGCTTCCTC TTTGTCTGCT TGTTCTTTTG TCTTCTGTAA 43 80 
50 GATTTCCATT GCATTTGGAG GCAGTGGGCA TGTGAAAGTC ATATCTATTT TTTTTTTGTC 4440 
AGAGCATAGT TATATGAATT CCATTGTTGT TGCAATAGCT CGGTATAATG TAACCATGTT 4500 
ACTAGCTTAA GATTTCCCAC TTAGGATGTA AGAAATATTG CATTGGAGCG TCTCCAGCAA 4560 

55 

GCCATTTCCT ACCTTATTAA TGAGAGAGAG ACAAGGGGGG GGGGGGGGGG GGGGTTCCCT 4 620 
TCATTATTCT GCGAGCGATT CAAAAACTTC CATTGTTCTG AGGTGTACGT ACTGCAGGGA 4 6 80 
60 TCTCCCATTA TGAAGAGGAT ATAGTTAATT CTTTGTAACC T AC TTGG AAA CTTGAGTCTT 47 4 0 
GAGGCATCGC T AA TAT AT AC TATCATCACA ATACTTAGAG GATGCATCTG AAATTTTAGT 4 800 
GTGATCTTGC ACAGGAACCG AAG AT AAATT CATATGCTAA TTTTAGGGAT GAGGTGTTGC 4 860 
CAAGAATTAA AAGGCTTGGA TACAATGCAG TGCAGATAAT GGCAATCCAG GAGCATTCAT 4920 
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ACTATGCAAG CTTTGGGTAT TCACACAATC CATTTTTTTC TGTATACACT CTTCACCCAT 4 9 80 
TTGGAGCTAT TACATCCTAA TGCTTCATGC ACATAAAATA TTTGGATATA ATCCTTTATT 504 0 
5 AGATATATAG TACAACTACA CTTAGTATTC TGAAAAAGAT CATTTTATTG TTGTTGGCTT 5100 
GTTCCAGGTA CCATGTTACT AATTTTTTTG CACCAAGTAG CCGTTTTGGA ACTCCAGAGG 5160 
ACTTAAAATC CTTGATCGAT AGAGCACATG AGCTTGGTTT GCTTGTTCTT ATGGATATTG 52 2 0 
TTCATAGGTA ATTAGTCCAA TTTAATTTTA GCTGTTTTAC TGTTTATCTG GTATTCTAAA 52 80 
GGGAAATTCA GGCAATTATG ATACATTGTC AAAAGCTAAG AGTGGCGAAA GTGAAATGTC 5 3 40 
15 AAAATCTAGA GTGGCATAAG GAAAATTGGC AAAAACTAGA GTGGCAAAAA TAAAATTTTC 54 00 
CCATCCTAAA TGGCAGGGCC CTATCGCCGA ATATTTTTCC ATTCTATATA ATTGTGCTAC 54 60 
2o GTGACTTCTT TTTTCTCAGA TG T ATT AAA C C AGTTGG AC A TGAAATG TAT TTGGTACATG 552 0 
TAGTAAACTG ACAGTTCCAT AGAATATCGT TTTGTAATGG CAACACAATT TGATGCCATA 5580 
GATGTGGATT GAGAAGTTCA GATGCTATCA ATAGAATTAA TCAACTGGCC ATGTACTCGT 5640 
25 GGCACTACAT AT AG TTTGC A AGTTGGAAAA CTGACAGCAA TACCTCACTG ATAAGTGGCC 5700 
AGGCCCCACT TGCCAGCTTC ATACTAGATG TTACTTCCCT GTTGAATTCA TTTGAACATA 57 60 
30 TTACTTAAAG TTCTTCATTT GTCCTAAGTC AAACTTCTTT AAGTTTGACC AAGTCTATTG 5 820 
GAAAATATAT CAACATCTAC AACACCAAAT TACTTTGATC AGATTAACAA TTTTTATTTT 58 80 
ATTATATTAG CACATCTTTG ATGTTGTAGA TATCAGCACA TTTTTCTATA GACTTGGTCA 59 40 

3 5 AATATAGAGA AGTTTGACTT AGGACAAATC TAGAACTTCA ATCAATTTGG ATCAGAGGGA 6000 

ACATCAAATA ATATAGATAG ATGTCAACAC TTCAACAAAA AAATCAGACC TTGTCACCAT 60 60 
ATATGCATCA GACCATCTGT TTGCTTTAGC CACTTGCTTT CATATTTATG TGTTTGTACC 6120 
TAATCTACTT TTCCTTCTAC TTGGTTTGGT TGATTCTATT TCAGTTGCAT TGCTTCATCA 6180 
ATGATTTTGT GTACCCTGCA GTCATTCGTC AAATAATACC CTTGACGGTT TGAATGGTTT 62 40 

4 5 CGATGGCACT GATACACATT ACTTCCACGG TGGTCCACGC GGCCATCATT GGATGTGGGA 63 00 

TTCTCGTCTA TTCAACTATG GGAGTTGGGA AGTATGTAGC TCTGACTTCT GTCACCATAT 63 60 
' TTGGCTAACT GTTCCTGTTA ATCTGTTCTT AC AC ATG TTG ATATTCTATT CTTATGCAGG 642 0 

TATTGAGATT CTTACTGTCA AACGCGAGAT GGTGGCTTGA AGAATATAAG TTTGATGGAT 64 8 0 

TTCGATTTGA TGGGGTGACC TCCATGATGT ATACTCACCA TGGATTACAA GTAAGTCATC 65 4 0 
55 AAGTGGTTTC AGTAACTTTT TTAGGGCACT GAAACAATTG CTATGCATCA TAACATGTAT 66 00 

CATGATCAGG ACTTGTGCTA CGGAGTCTTA GATAGTTCCC TAGTATGCTT GTACAATTTT 6660 
^ ACCTGATGAG ATCATGGAAG ATTGGAAGTG ATTATTATTT ATTTTCTTTC TAAGTTTGTT 67 2 0 

TCTTGTTCTA GATGACATTT ACTGGGAACT ATGGCGAATA TTTTGGATTT GCTACTGATG 67 80 

TTGATGCGGT AGTTTACTTG ATGCTGGTCA ACGATCTAAT TCATGGACTT TATCCTGATG 6 84 0 
65 CTGTATCCAT TGGTGAAGAT GTAAGTGCTT ACAGTATTTA TGATTTTTAA CTAGTTAAGT 6 900 

AGTTTTATTT TGGGGATCAG TCTGTTACAC TTTTTGTTAG GGGTAAAATC TCTCTTTTCA 6 960 
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TAACAATGCT AATTT AT AC C TTGTATGATA ATGCATCACT TAG T AATTTG AAAAGTGCAA 7 02 0 
GGGCATTCAA GCTTACG AG C ATATTTTTTG ATGGCTGTAA TTTATTTGAT AGTATGCTTG 70 8 0 

5 

TTTGGGTTTT TCAATAAGTG GGAGTGTGTG ACTAATGTTG TATTATTTAT TTAATTGCGG 714 0 
AAGAAATGGG CAACCTTGTC AATTGCTTCA GAAGGCTAAC TTTGATTCCA TAAACGCTTT 7 2 00 
10 GGAAATGAGA GGCTATTCCC AAGGACATGA ATTATACTTC AGTGTGTTCT GTACATGTAT 72 60 
TTGTAATAGT GGTTTAACTT AAATTCCTGC ACTGCTATGG AATCTCACTG TATGTTGTAG 73 2 0 
TGTACACATC CACAAACAAG TAATCCTGAG CTTTCAACTC ATGAGAAAAT AGAGTCCGCT 7 3 80 

15 

TCTGCCAGCA TTAACTGTTC ACAGTTCTAA TTTGTGTAAC TGTGAAATTG TTC AGGTC AG 7 44 0 
TGGAATGCCT ACATTTTGCA TCCCTGTTCC AGATGGTGGT GTTGGTTTTG ACTACCGCCT 7 50 0 
20 GCATATGGCT GTAGCAGATA AATGGATTGA ACTCCTCAAG TAAGTGCAGG AATATTGGTG 7 560 
ATTACATGCG CACAATGATC TAGATTACAT TTTCTAAATG GTAAAAAGGA AAATATG TAT 7620 
GTGAATATCT AGACATTTGC CTGTTATCAG CTTGAATACG AGAAGTCAAA TACATGATTT 7 6 80 

25 

AAAT AG C AAA TCTCGGAAAT GTAATGGCTA GTGTCTTTAT GCTGGGCAGT GTACATTGCG 77 4 0 
CTGTAGCAGG CCAGTCAACA CAGTTAGCAA TATTTTCAGA AACAATATTA TTTATATCCG 7 800 
3 0 TATATGAGAA AGTTAGTATA TAAACTGTGG TCATTAATTG TGTTCACCTT TTGTCCTGTT 7860 
TAAGGATGGG CAGTAGGTAA TAAATTTAGC CAGATAAAAT AAATCGTTAT TAGGTTTACA 7 92 0 
AAAGGAATAT ACAGGGTCAT GTAGCATATC TAGTTGTAAT TAATGAAAAG GCTGACAAAA 7 9 80 

35 

GGCTCGGTAA AAAAAACTTT ATGATGATCC AGATAGATAT GCAGGAACGC GACTAAAGCT 804 0 
CAAATACTTA TTGCTACTAC ACAGCTGCCA ATCTGTCATG A.TCTGTGTTC TGCTTTGTGC 8100 
40 TATTTAGATT TAAATACTAA CTCGATACAT TGGCAATAAT AAACTTAACT ATTCAACCAA 816 0 
TTTGG TGG AT ACCAGAATTT CTGCCCTCTT GTTAGTAATG ATGTGCTCCC TGCTGCTGTT 82 2 0 
CTCTGCCGTT ACAAAAGCTG TTTTCAGTTT TTTGCATCAT TATTTTTGTG TGTGAGTAGT 82 80 

45 

TTAAGCATGT TTTTTGAAGC TGTGAGCTGT TGGTACTTAA TACATTCTTG GAAGTGTCCA 83 4 0 
AATATGCTGC AGTGTAATTT AGCATTTCTT TAACACAGGC AAAGTGACGA ATC TTGG AAA 84 0 0 
50 ATGGG CG AT A TTGTGCACAC CCTAACAAAT AGAAGGTGGC TTGAGAAGTG TGTAACTTAT 84 6 0 
GCAGAAAGTC ATGATCAAGC ACTAGTTGGT GACAAGACTA TTGCATTCTG GTTGATGGAT 85 2 0 
AAGGTACTAG CTGTTACTTT TGGACAAAAG AATTACTCCC TCCCGTTCCT AAATATAAGT 858 0 

55 

CTTTGTAGAG ATTCCACTAT GGACCACATA G TAT AT AG AT GCATTTTAGA GTGTAGATTC 364 0 
ACTCATTTTG CTTCGTATGT AGTCCATAGT GAAATCTCTA CAGAGACTTA TATTTAGGAA 87 00 
60 CGGAGGGAGT ACATAATTGA TTTGTCTCAT GAGATTGCTA GTGTTTTCTT GTGATAAAGA 37 6 0 
TTGGCTGCCT CACCCATCAC CAGCTATTTC CCAACTGTTA CTTGAGCAGA ATTTGCTGAA 882 0 
AACGTACCAT GTGGTACTGT GGCGGCTTGT GAACTTTGAC AGTTATGTTG CAATTTTCTG 3 880 
TTCTTATTTA TTTGATTGCT TATGTTACCG TTCATTTGCT CATTCCTTTC CGAGACCAGC 894 0 
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CAAAGTCACG TGTTAGCTGT GTGATCTCTT ATCTGAATCT TGAGCAAATT TTATTAATAG 9000 
GCTAAAATCC AACGAATTAT TTGCTTGAAT TTAAATATAC AGACGTATAG TCACCTGGCT 9060 
5 CTTTCTTAC-A TGATTACCAT AGTGCCTGAA GGCTGAAATA GTTTTGGTGT TTCTTGGATG 912 0 
CCGCCTAAAG GAGTGATTTT TATTGGATAG ATTCCTGGCC GAGTCTTCGT TACAACATAA 9180 
iq CATTTTGGAG ATATGCTTAG TAACAGCTCT GGGAAGTTTG GTCACAAGTC TGCATCTACA 924 0 
CGCTCCTTGA GGTTTTATTA TGGCGCCATC TTTGTAACTA GTGGCACCTG TAAGGAAACA 9300 
CATTCAAAAG GAAACGGTCA CATCATTCTA ATCAGGACCA CCATACTAAG AGCAAGATTC 93 60 
15 TGTTCCAATT TTATGAGTTT TTGGGACTCC AAAGGGAACA AAAGTGTCTC ATATTGTGCT 9420 
TATAACTACA GTTGTTTTTA T AC CAGTGT A GTTTTATTCC AGGACAGTTG ATACTTGGTA 9480 
CTGTGCTGTA AATTATTTAT CCGACATAGA ACAGCATGAA CATATCAAGC TCTCTTTGTG 9540 
CAGGATATC-T ATGATTTCAT GGCTCTGGAT AGGCTTCAAC TCTTCGCATT GATCGTGGCA 9600 
TAGCATTACA TAAAATGATC AGGCTTGTCA CCATGGGTTT AGGTGGTGAA GGCTATCTTA 9660 
25 ACTTCATGGG AAATGAGTTT GGGCATCCTG GTCAGTCTTT ACAACATTAT TGCATTCTGC 9720 
ATGATTGTGA TTTACTGTAA TTTGAACCAT GCTTTTCTTT CACATTGTAT GTATTATGTA 9780 
3q ATCTGTTGCT TCCAAGGAGG AAGTTAACTT CTATTTACTT GGCAGAATGG ATAGATTTTC 9840 
CAAGAGGCCC ACAAACTCTT CCAACCGGCA AAGTTCTCCC CTGGAAATAA CAATAGTTAT 9900 
GATAAATGCC GCCGTAGATT TGATCTTGTA AGTTTTAGCT GTGCTATTAC ATTCCCTCAC 9960 
35 TAGATCTTTA TTGGCCATTT ATTTCTTGAT GAAATCATAA TGTTTG TT AG GAAAGATCAA 10020 
CATTGCTTTT GTAGTTTTGT AGACGTTAAC ATAAGTATGT GTTGAGAGTT GTTGATCATT 10080 
AAAAATATCA TGATTTTTTG CAGGGAGATG CAGATTTTCT TAGATATCGT GGTATGCAAG 10140 
AGTTCGATCA GGCAATGCAG CATCTTGAGG AAAAATATGG GGTATGTCAC TGGTTTGTCT 10200 
TTGTTGCATA ACAAGTCACA GTTTAACGTC AGTCTCTTCA AGTGGTAAAA AAAGTGTAGA 10260 
45 ATTAATTCCT GTAATGAGAT GAAAACTGTG CAAAGGCGGA GCTGGAATTG CTTTTCACCA 10320 
AAACTATTTT CTTAAGTGCT TGTGTATTGA TACATATACC AGCACTGACA ATGTAACTGC 10380 
5q AGTTTATGAC ATCTGAGCAC CAGTATGTTT CACGGAAACA TGAGGAAGAT AAGGTGATCA 10440 
TCCTCAAAAG AGG AG ATTTG GTATTTGTTT TCAACTTCCA CTGGAGCAAT AGCTTTTTTG 10500 
ACTACCGTGT TGGGTGTTCC AAGCCTGGGA AGTACAAGGT ATGCTTGCCT TTTCATTGTC 10560 
55 CACCCTTCAC CAGTAGGGTT AGTGGGGGCT TCTACAACTT TTAATTCCAC ATGGATAGAG 10620 
TTTGTTGGTC GTGCAGCTAT CAATATAAAG AATAGGGTAA TTTGTAAAGA AAAGAATTTG 10680 
go CTCGAGCTGT TGTAGCCATA GGAAGGTTGT TCTTAACAGC CCCGAAGCAC ATACCATTCA 10740 
TTCATATTAT CTACTTAAGT GTTTGTTTCA ATCTTTATGC TCAGTTGGAC TCGGTCTAAT 10800 
ACTAGAACTA TTTTCCGAAT CTACCCTAAC CATCCTAGCA GTTTTAGAGC AGCCCCATTT 10860 
65 GGACAATTGG CTGGGTTTTT G TTAGTTG TG ACAGTTTCTG CTATTTCTTA ATCAGGTGGC 10920 
CTTGGACTCT GACGATGCAC TCTTTGGTGG ATTCAGCAGG CTTGATCATG ATGTCGACTA 10980 
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CTTCACAACC GTAAGTCTGG GCTCAAGCGT CACTTGACTC GTCTTGACTC AACTGCTTAC 11040 
AAATCTGAAT CAACTTCCCA ATTGCTGATG CCCTTGCAGG AACATCCGCA TGACAACAGG 11100 
5 CCGCGCTCTT TCTCGGTGTA CACTCCGAGC AGAACTGCGG TCGTGTATGC CCTTACAGAG 11160 
TAAGAACCAG CAGCGGCTTG TTACAAGGCA AAGAGAGAAC TCCAGAGAGC TCGTGGATCG 11220 
TGAGCGAAGC GACGGGCAAC GGCGCGAGGC TGCTCCAAGC GCCATGACTG GGAGGGGATC 11280 

10 

GTGCCTCTTC CCCAGATGCC AGGAGGAGCA GATGGATAGG TAGCTTGTTG GTGAGCGCTC 113 40 
GAAAGAAAAT GGACGGGCCT GGGTGTTTGT TGTGCTGCAC TGAACCCTCC TCCTATCTTG 11400 
15 CACATTCCCG GTTGTTTTTG TACATATAAC TAATAATTGC CCGTGCGCTC AACGTGAAAA 11460 
TCC 11463 

(2) INFORMATION FOR SEQ ID NO: II: 
2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: triticum tauschii 
(F) TISSUE TYPE: Endosperm 

35 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATIONS. .2651 

(D) OTHER INFORM ATION:/producl= "nucleotide sequence of 
40 cDNA wheat SSS V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCTCCCACTC TTCTCTCCCC GCGCACACCG AGTCGGCACC GGCTCATCAC CCATCACCTC 60 

45 

GGCCTCGGCC ACCGGCAAAC CCCCCGATCC GCTTTTGCAG GCAGCGCACT AAAACCCCGG 12 0 

GGAGCGCGCC CCGCGGCAGC AGCAGCACCG CAGTGGGAGA GAGAGGCTTC GCCCCGGCCC 180 

50 GCACCGAGCG GGGCGATCCA CCGTCCGTGC GTCCGCACCT CCTCCGCCTC CTCCCCTGTC 240 

CCGCGCGCCC ACACCCATGG CGGCGACGGG CGTCGGCGCC GGGTGCCTCG CCCCCAGCGT 3 00 

CCGCCTGCGC GCCGATCCGG CGACGGCGGC CCGGGCGTCC GCCTGCGTCG TCCGCGCGCG 3 60 

55 

GCTCCGGCGC TTGGCGCGGG GCCGCTACGT TGCCGAGCTC AGCAGGGAGG GCCCCGCGGC 42 0 

GCGCCCCGCG CAGCAGCAGC AACTGGCCCC GCCGCTCGTG CCAGGCTTCC TCGCGCCGCC 4 80 

60 GCCGCCCGCG CCCGCCCAGT CGCCGGCCCC GACGCAGCCG CCCCTGCCGG ACGCCGGCGT 54 0 

GGGGGAACTC GCGCCCGACC TCCTGCTCGA AGGGATTGCT GAGGATTCCA TCGACAGCAT 600 
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AATTGTGGCT GCAAGTGAGC AGGATTCTGA GATCATGGAT GCGAATGAGC AACCTCAAGC 6 50 
TAAAGTTACA CGTAGCATCG TG TTTGTG AC TGGTGAAGCT GCTCCTTATG CAAAGTCAGG 72 0 
GGGGCTGGGA GATG TTTGTG GTTCGTTACC AATTGCTCTT GCTGCTCGTG GTCACCGTGT 730 
GATGGTTGTA ATGCCAAGAT ACTTGAATGG GTCCTCTGAT AAAAACTATG CAAAGGCATT 84 0 
10 ATACACTGGG AAGCACATTA AGATTCCATG CTTTGGGGGA TCACATGAAG TGACCTTTTT 900 
TCATGAGTAT AGAGACAACG TCGATTGGGT GTTTGTCGAT CATCCGTCAT ATC AT AG AC C 95 0 
AGGAAGTTTA TATGGAGATA ATTTTGGTGC TTTTGGTGAT AATCAGTTCA GATACACACT 102 0 
CCTTTGCTAT GCTGCATGCG AGGCCCCACT AATCCTTGAA TTGGGAGGAT ATATTTATGG 103 0 
ACAGAATTGC ATGTTTGTTG TGAACGATTG GCATGCCAGC CTTGTGCCAG TCCTTCTTGC 1140 
2 0 TGCAAAATAT AGACCATACG GTGTTTACAG AGATTCCCGC AGCACCCTTG TTATACATAA 1200 
TTTAGCACAT CAGGGTCTGG AGCCTGCAAG TACATATCCT GATCTGGGAT TGCCACCTGA 1250 
« 2 5 ATGGTATGGA GCTTTAGAAT GGGTATTTCC AGAATGGGCA AGGAGGCATG CCCTTGACAA 132 0 

,Q GGGTGAGGCA GTTAACTTTT TGAAAGGAGC AGTCGTGACA GCAGATCGAA TTGTGACCGT 1380 

q CAGTCAGGGT TATTCATGGG AGGTCACAAC TGCTGAAGGT GGACAGGGCC TCAATGAGCT 1440 

|S 30 CTTAAGCTCC CGAAAAAGTG TATTGAATGG AATTGTAAAT GGAATTGACA TTAATGATTG 1500 

%j GAACCCCACC ACAGACAAGT GTCTCCCTCA TCATTATTCT GTCGATGACC TCTCTGGAAA 1560 

GGCCAAATGT AAAGCTGAAT TGCAGAAGGA GCTGGGTTTA CCTGTAAGGG AGGATGTTCC 162 0 
TCTGATTGGC TTTATTGGAA GACTGGATTA CCAGAAAGGC ATTGATCTCA TTAAAATGGC 1680 
CATTCCAGAG CTCATGAGGG AGGACGTGCA GTTTGTCATG CTTGGATCTG GGGATCCAAT 174 0 
40 TTTTGAAGGC TGGATGAGAT CTACCGAGTC GAGTTACAAG GATAAATTCC GTGGATGGGT 180 0 
TGGATTTAGT GTTCCAGTTT CCCACAGAAT AACTGCAGGT TGCGATATAT TGTTAATGCC 186 0 
ATCCAGGTTT GAACCTTGTG GTCTTAATCA GCTATATGCT ATGCAATATG GTACAGTTCC 1920 
TGTAGTTCAT GGAACTGGGG GCCTCCGAGA CACAGTCGAG ACCTTCAACC CTTTTGGTGC 1980 
AAAAGGAGAG GAGGGTACAG GGTGGGCGTT CTCACCGCTA ACCGTGGACA AGATGTTGTG 2 04 0 
50 GGCATTGCGA ACCGCGATGT CGACATTCAG GGAGCACAAG CCGTCCTGGG AGGGGCTCAT 2100 
GAAGCGAGGC ATGACGAAAG ACCATACGTG GGACCATGCC GCCGAGCAGT ACGAGCAGAT 2160 
CTTCGAATGG GCCTTCGTGG ACCAACCCTA CGTCATGTAG ACGGGGACTG GGGAGGTCGA 2 220 
AGCGCGGGTC TCCTTGAGCT CTGAAGACAT GTTCCTCATC CTTCCGCGGC CCGGAAGGAT 2280 
ACCCCTGTAC ATTGCGTTGT CCTGCTACAG TAGAGTCGCA ATGCGCCTGC TTGCTTGGTC 2 3 40 
6 0 CGCCGGTTCG AGAGTAGATG ACGGCTGTGC TGCTGCGGCG GTGACAGCTT CGGGTGGATG 2 400 
ACAGTTACAG TTTTGGGGAA TAAGGAAGGG ATGTGCTGCA GGATGGTTAA CAGCAAAGCA 2 4 60 
CCACTCAGAT GGCAGCCTCT CTGTCCGTGT TACAGCTGAA ATCAGAAACC AACTGGTGAC 2 52 0 
TCTTTAGCCT TAGCGATTGT GAAGTTTGTT GCATTCTGTG TATGTTGTCT TGTCCTTAGC 2 580 
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55 



65 



WO 99/14314 



PCT/AU98/00743 



- 83 - 

TGACAAATAT TAGACCTGTT GG AG AATTTT ATTTATCTTT GCTGCTGTTG TTTTTGTTTT 2 64 
GTTAAAAAAA AAAAAAAAAA AA 2662 

5 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
1 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



15 



30 



45 



(vi) ORIGINAL SOURCE: 
(A) ORGANISM: triticum tauschii 



(ix) FEATURE: 
2 0 (A) NAME/KEY: Protein 
(B) LOCATION: I ..768 

(ix) FEATURE: 
(A) NAME/KEY: Protein 
2 5 (B) LOCATION: 1 ..768 

(D) OTHER INFORMATION:/product= "deduced amino acid 
sequence SBE II" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Thr Phe Ala Val Ser Gly Ala Thr Leu Gly Val Ala Arg Pro 
15 10 15 



Pro Ala Ala Ala Gin Pro Glu Glu Leu Gin lie Pro Glu Asp lie Glu 
35 20 25 30 

Glu Gin Thr Ala Glu Val Asn Met Thr Gly Gly Thr Ala Glu Lys Leu 
35 40 45 

4 0 Glu Ser Ser Glu Pro Thr Gin Gly lie Val Glu Thr lie Thr Asp Gly 

50 55 60 



Val Thr Lys Gly Val Lys Glu Leu Val Val Gly Glu Lys Pro Arg Val 

65 70 75 80 

Val Pro Lys Pro Gly Asp Gly Gin Lys He Tyr Glu He Asp Pro Thr 
85 90 95 



Leu Lys Asp Phe Arg Ser His Leu Asp Tyr Arg Tyr Ser Glu Tyr Arg 

50 100 105 110 

Arg He Arg Ala Ala He Asp Gin His Glu Gly Gly Leu Glu Ala Phe 
115 120 125 

5 5 Ser Arg Gly Tyr Glu Lys Leu Gly Phe Thr Arg Ser Ala Glu Gly He 

130 135 140 



60 



Thr Tyr Arg Glu Trp Ala Pro Gly Ala His Ser Ala Ala Leu Val Gly 
145 150 155 160 
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Asp Phe Asn Asn Trp Asn Pro Asn Ala Asp Thr Met Thr Arg Asp Asp 
155 170 175 

Tyr Gly Val Trp Glu He Phe Leu Pro Asn Asn Ala Asp Gly Ser Pro 
180 185 190 

Ala He Pro His Gly Ser Arg Val Lys He Arg Met Asp Thr Pro Ser 
195 200 205 

Gly Val Lys Asp Ser He Ser Ala Trp He Lys Phe Ser Val Gin Ala 
210 215 220 

Pro Gly Glu He Pro Phe Asn Gly He Tyr Tyr Asp Pro Pro Glu Glu 
15 225 230 235 240 

Glu Lys Tyr Val Phe Gin His Pro Gin Pro Lys Arg Pro Glu Ser Leu 
245 250 255 

.Arg He Tyr Glu Ser His He Gly Met Ser Ser Pro Glu Pro Lys lie 
^ U 260 265 270 

Asn Ser Tyr Ala Asn Phe Arg Asp Glu Val Leu Pro Arg He Lys Arg 
275 280 285 

; fl 25 Leu Gly Tyr Asn Ala Val Gin He Met Ala He Gin Glu His Ser Tyr 

"= 295 



35 



40 



45 



50 



55 



300 



Tyr Ala Ser Phe Gly Tyr His Val Thr Asn Phe Phe Ala Pro Ser Ser 
30 305 310 315 320 

Arg Phe Gly Thr Pro Glu Asp Leu Lys Ser Leu He Asp Arg Ala His 
325 330 335 

Glu Leu Gly Leu Leu Val Leu Met Asp He Val His Ser His Ser Ser 
340 345 350 

Asn Asn Thr Leu Asp Gly Leu Asn Gly Phe Asp Gly Thr Asp Thr His 
355 360 365 

Tyr Phe His Gly Gly Pro Arg Gly His His Trp Met Trp Asp Ser Arg 
370 375 3g0 

Leu Phe Asn Tyr Gly Ser Trp Glu Val Leu Arg Phe Leu Leu Ser Asn 
385 3 90 395 4 00 

Ala Arg Trp Trp Leu Glu Glu Tyr Lys Phe Asp Gly Phe Arg Phe Asp 
405 410 4i 5 

Gly Val Thr Ser Met Met Tyr Thr His His Gly Leu Gin Met Thr Phe 
420 425 430 

Thr Gly Asn Tyr Gly Glu Tyr Phe Gly Phe Ala Thr Asp Val Asp Ala 
43 5 440 445 

Val Val Tyr Leu Met Leu Val Asn Asp Leu He His Gly Leu His Pro 
450 455 460 

Asp Ala Val Ser He Gly Glu Asp Val Ser Gly Met Pro Thr Phe Cys 
60 465 470 4 75 480 

He Pro Val Pro Asp Gly Gly Val Gly Phe Asp Tyr Arg Leu His Met 
48 5 490 495 
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Ala Val Ala Asp Lys Trp He Glu Leu Leu Lys Gin Ser Asp Glu Ser 

500 505 510 

Trp Lys Met Gly Asp He Val His Thr Leu Thr Asn Arg Arg Trp Leu 

5 515 520 525 

Glu Lys Cys Val Thr Tyr Ala Glu Ser His Asp Gin Ala Leu Val Gly 

530 535 540 

10 Asp Lys Thr He Ala Phe Trp Leu Met Asp Lys Asp Met Tyr Asp Phe 

545 550 555 560 



15 



30 



Met Ala Leu Asp Arg Pro Ser Thr Pro Arg He Asp Arg Gly He Ala 
565 570 575 

Leu His Lys Met He Arg Leu Val Thr Met Gly Leu Gly Gly Glu Gly 
580 585 590 



Tyr Leu Asn Phe Met Gly Asn Glu Phe Gly His Pro Glu Trp He Asp 
20 595 600 605 

Phe Pro Arg Gly Pro Gin Thr Leu Pro Thr Gly Lys Val Leu Pro Gly 

610 615 620 

2 5 Asn Asn Asn Ser Tyr Asp Lys Cys Arg Arg Arg Phe Asp Leu Gly Asp 

625 630 635 640 



Ala Asp Phe Leu Arg Tyr His Gly Met Gin Glu Phe Asp Gin Ala Met 
645 650 655 

Gin His Leu Glu Glu Lys Tyr Gly Phe Met Thr Ser Glu His Gin Tyr 
660 665 670 



Val Ser Arg Lys His Glu Glu Asp Lys Val He He Phe Glu Arg Gly 

35 675 680 685 

Asp Leu Val Phe Val Phe Asn Phe His Trp Ser Asn Ser Phe Phe Asp 

690 695 700 

40 Tyr Arg Val Gly Cys Ser Arg Pro Gly Lys Tyr Lys Val Ala Leu Asp 

705 710 715 720 



45 



Ser Asp Asp Ala Leu Phe Gly Gly Phe Ser Arg Leu Asp His Asp Val 

725 730 735 

Asp Tyr Phe Thr Thr Glu His Pro His Asp Asn Arg Pro Arg Ser Phe 

740 745 750 



Ser Val Tyr Thr Pro Ser Arg Thr Ala Val Val Tyr Ala Leu Thr Glu 
50 755 760 765 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 10550 base pairs 
5 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: iriticum lauschii 

(ix) FEATURE: 
5 (A) NAME/KEY: exon 

(B) LOCATION: I.. 3 1 6 

(D) OTHER INFORMATION:/product= "exon I " 

(ix) FEATURE: 
1 0 (A) NAME/KEY: exon 

(B) LOCATION: 1472.. 1 828 

(D) OTHER INFORMATION :/product= exon 2" 

(ix) FEATURE: 
1 5 (A) NAME/KEY: exon 

(B) LOCATION:2766..2823 

(D) OTHER INFORMATION:/product= "exon 3" 

(ix) FEATURE: 

2 0 (A) NAME/KEY: exon 

(B) LOCATION:2906..3028 

(D) OTHER INFORMATION:/producr= "exon 4" 

(ix) FEATURE: 
25 (A) NAME/KEY: exon 

(B) LOCATION* 1 13..4194 

(D) OTHER INFORMATION:/product= "exon 5" 

(ix) FEATURE: 

3 0 (A) NAME/KEY: exon 

(B) LOCATION:4286..4459 

(D) OTHER INFORMATION:/produci= "exon 6" 

(ix) FEATURE: 

3 5 (A) NAME/KEY: exon 

(B) LOCA*TION:4562..4643 

(D) OTHER INFORMATION7product= "exon 7" 

(ix) FEATURE: 

4 0 (A) NAME/KEY: exon 

(B) LOCATION:4744..4855 

(D) OTHER INFORMATION:/product= "exon 8" 

(ix) FEATURE: 

4 5 (A) NAME/KEY: exon 

(B) LOCATION:4999..502I 

(D) OTHER INFORMATION:/product= "exon 9" 

(ix) FEATURE: 

5 0 (A) NAME/KEY: exon 

(B) LOCATIONS 102..5 1 92 

(D) OTHER INFORM ATION:/produci= "exon 10" 

(ix) FEATURE: 
5 5 (A) NAME/KEY: exon 

(B) LOCATION:8593..87l8 
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(D) OTHER INFORMATION:/product= "exon 11" 






5 


(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION:8807..89I5 

(D) OTHER INFORMATION:/product= exon 12" 






10 


(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION:8992..9104 

(D) OTHER INFORMATION:/product= "exon 13" 






15 


(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION;9I61..9199 

(D) OTHER INFORM ATION:/produci= "exon 14" 






20 


(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION:9498..9713 

(D) OTHER INFORMATION :/product= "exon 15" 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 




S .TSST 


25 


ATGGCGGCGA CGGGCGTCGG CGCCGGGTGC CTCGGGCCCA GCGTCCGCCT 


50 


fjj 




GCGCGCCGAT CCGGCGACGG CGGCCCGGGC GTCCGCTTGC GTCGTCCGCG 


100 




30 


CGCGGCTCCG GCGCTTGGCG CGGGGCCGCT ACGTCGCCGA GCTCAGCAGG 


150 




GAGGGCCCCG CGGCGCGCCC CGCGCAGCAG CAGCAACTGG CCCCGCCGCT 


200 






CGTGCCAGGC TTCCTCGCGC CGCCGCCGCC CGCGCCCGCC CAGTCGCCGG 


250 


!=% 


35 


CCCCGACGCA GCCGCCCCTG CCGGACGCCG GCGTGGGGGA ACTCGCGCCC 


300 






GACCTCCTGC TCGAAGGTAA AAAACA AGGC TGAATCCTCA GATCACTCCG 


350 




40 


CGTCTTCGTT TTACCAAATA CGGTACTGCG AAGTGGTGCT GTATATGTG A 


400 




AGTTTCTGTC G ATTTCTTCC TGACGGATGT TCAGTCG ATT CAGTTGTATA 
TATGTG AT AC GTTCGTTGTT CATCGATCGT AC A G ATTT AC CAGCACACTA 


450 
500 




45 


GATAGAAATC GAGACCGACG CGGGCAGATC AATAGATTTT TCTAGACGTT 
TTATTGGATC GTG AG ATGAT TGATTGGGGT GGCGTGTCGA TACGATAGCG 


550 
600 




en 


GTGCACCGCC GATGTATCGG GGCATGTGCA CGTGGTTGGG TCTC AGC AG A 


650 




CATATCACTA GACTGGTATC GT A ATTT A CT AGTACTACTG GAAAGAGGAC 
TAAA AAGGCT AGGCCAAGTG CACGCATGTT GGG A ACGTTG TTAAATTGAT 


700 
750 




55 


GAGTTTGTCC TTTGCTTGGG CTGGTATTAT TACCAAAAAA TGGTGTTAGT 


800 
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CCCTGTAC IT ArrAATnnPiA a a atptta at ATrATATTpr rrTTTATpAr 


850 






TCTCC A A I TG TATATTPTP A n.n a ptp a a pt n. a 'itit a rTr ata otot a ot 

4 1 IVJ v-/\ uLAL I LAAL 1 OA i 1 1 I AL lu AJ AC I GTAGT 


900 




5 


GGAAATGACA CGTGAGPAPP nnnm^rn A AT. HA ATrfA a xr OTT/ -i-itoto 
x -" l,-ri v-vj 1 \-ii-ivj\„rt^v^ t ttt 1 1 t AAU U AA i ULAA lu Li 1 C 1 i 1 C i C 


950 






IT IT ATA IT A CAGG AAPTAP AAf.r.AnrTTr rArrmrrAr t a o a o a a ot a 

«. 1 * * * * * vnvjunAv^ 1 rvvj /■vrYUUAUt 1 it t A tt ill UAu 1 AG AG A AG 1 A 


1000 




10 


CTCCCTCCGT TCP A A A ATA P A TP. A rTr A A r TTTP.T A rT A a ttttpt a ot a 
* 1 1 *-v,.rv/vrt./\ 1 rt.u /\ 1 O AC 1 LAAL 1 1 lui AG 1 A A 1111 GTACTA 


1050 




TAG TTAGTAC AAAPTTP A PT rATrTA'rrrr An a Arrr a 00 0 a ot a ot a t/** 
* 1 ^ * 1 1 1 t/\ iliai 1 1 1 AuAALUuAUu GAG i AGTATC 


1 100 






GAAATTGAAG ACCCTTGTAT TArTr,Tf > 'l'i'r. 'ititita a Tr a a a atooo a 0 
* ^ iVI 1 ivJiA 1 1 /\t luJti 1 u liiii LAA 1 G AAAA 1 GGGAG 


1 150 




15 


GCCCATGCAG TAAGTCAPAT ncicm Acrrnr: n a mcrcc^r* a to a TrTrT^r 
v^v^v^^r*. * \_iv^i-kv_i inAuiv-AtAi uuULALL 1 uu GAGGG 1 GGGA ICAlGTGTGt 


1200 






TTTGCAGAGT ACTAPAPPPA nrTf a rrrrr thtt a r a tt a ottottoooo 

* vjv,nvj^wvj x a j~wv_i^a.v_\_v_/\ ot 1 LALLL it lull AuA 1 1 A G 1 I GTTGGGC 


1250 




20 


TGCTTAC ITTG Tfl I nnvnT HPa^tatatp a p a r* a Trrn*/^ a a Trr^*-^ * -r* 
i vjv- i rtv, luiii ut lul ULAU 1 A 1 A 1 G AGAGA 1 GGT G A A 1 1 1GGCAT 


1300 




CTAGCTGAGA ACAHA ATPP a r.r.TmrArrA ttott a tt a t Tr 1 /-^ a a aoto 

* 1 wrivjn nv,ftvj/\A 1 UtA UU I I ULALLA 11G11A11A1 1 GG 1 AAACTG 


1350 






TTGTC ACGC A AVITA TA a a p a a Trvrn a tpt tptp a ot a tt a a tt* a a t/-» a t 

i * ^ i v.nv.vjv.n ni 1 l /\ l AAAu AA 1U1 UA 1L1 1 G 1 GAG 1 A I 1 AATTAATCAT 


1400 


Q 


25 


GTTCTGCTA A T A TPTP.TPPnr nnnTn^rnr"rr % tto a o a a ata t a oo a a * 
U1 IV1 ^ 1 lAiLiui tt l tot 1 1 1 GG lu 1 1 G AG A AATA TACC ATATG A 


1450 






ATA i i l l c '( a T I * r i PiP a a en a nnn a TTrrr o a oo a TTpr« a too a a-* a * 
nmu 1 A *~c\ iii lutAAtt AuOuAl 1GG1 GAGGA1 I GG A TCG ACAGC AT 


1500 




30 


AATCGTGGPT PP a a PTP a nr^ a nr\ a TTrTr 1 a o a to a too a t r^r^r^ a a t~s> a 
rwn. i i uuu i ulaau i uAUL AGG A 1 1 G 1 OA GA 1 GATGGAT GCG AATGAGC 


1550 




AACCTCAAGP TAA AGTTAPA PP.TAPPATrr TrTTTrTr a o too to a a oot 
1 i nAj ^^ 1 1 At a to 1 AULA 1 tG 1 G 1 1 1 G 1 GAG 1 GGTGAAGCT 


1600 


;_3 1 
In 




GCTCCTTATG PA A A P.TPA p.n nnnnn^rnnr^ a r* a to ttto to ottoott a oo 
u ^ 1 ^ w ^ t/\/\/\u i LAUU UuLLfL 1 GGGA GA 1G1 1 1GIG G I TCGTTACC 


1650 


!^ 


35 


AATTGCTCTT Pi PTP PTP P. TP P.TPArrrTrT o a toottot a a tooo a a /- a t 
i i vjc i uc i tu l u u i LALLU 1 G 1 GA 1 GG 1 1 G 1 A ATGCCAAGAT 


1700 






APT IP A ATPP PiTPPTPTP at a a a a a ot a to o a a a ooo a tt a t a a n^/io/^ 
rw^ i i i *au kj i tt 1 1 1 UA i AAAAAt 1 A 1 G GAAAGGGA 1 1 ATACACTGCG 


1750 




40 


A APP A P ATTA APATTPPATr* r'ri'iTrrpp a toaoatoa a a r>/->m»nm» 
rLn.vj^t\^t\ i i a aua i i tt A 1 G G 1 1 1 GGGGG A 1 GAG ATGAAG TG ACC I i 111 


1800 




TCATP APTAT APAPAPa app. ^rnr^, a ttooot ooota o a o a a to a ^ / ->p l . / -K r > r 
i v-i-A i vj^wj i ai nuAUAtAALU 1 tUA 1 1 GGG 1 GGG 1 AGAG AA Tt ACC 11 G 1 1 


1850 






ATTCTCTGTT PA ATTPTAPP A A PTP.' i-itaT ootto im-j' a o aottotttta 
a i vjrtAv i i u i aul AAt 1 u 1 1 1 A 1 tt 1 1 G 1 I 1 AC AG 11 C 111 1 A 


1900 




45 


GCCCTGCAAA GAPATATPTP A tttpp a t a n 1 1 * i * i ' r i • i ttt a tttooottot 
^ * v-^wi-n-vin. \jy^v-rv l A i u 1 vj /\ 1 1 1 tt A 1 At llllllOilA 111 CCtTTGT 


1950 






A CTC 11 GPTP ATPAAPPTPa a a a t a Tf^ ata t a too a too a a ot/-» a tv» /-^ a 

1 1 rt 1 wauu 1 1/\ aaa i A 1 1 A 1 A 1 A 1 tt A 1 GG A AG 1 1 ATGCAT 


2000 




50 


GTGCCTAGTA n ri "i pptp.t pp.r.Tr.rrTTT a aotttoa oo o atta at*/-*^ 
1 1 rt liiii uu i u l tuu 1 Gtt 111 AAt ill CAGG G A 1 TAATACG 


2050 




TGGAATTTGA TAACTAAAGT TTA i 1 1 1 ATT GAAAAAAATT GTAGGTTGG 


2100 






TGAGCCCACA GCCACGCAGT GGCACCACTG CTTGCACATG ATTTTGCATT 


2150 




55 


TCTGTTTGCA CCG AGCACTT CATGTGA ATA AGGTGTAAAA TCATAAAGTA 


2200 



- 89 - 

CCAATTTTAT TCTGCCAATT GCACTTAAG A GTATATACAT TTATCTTG G C 
CTCAATCATG GG AGTACTGT GC ATTCAGTG CACCATCATT GTTCTA A G G A 
GAAAATGTGG GTGCAAGGAA GACACTTTTG TCCCTTAATA AAAGGCAGGC 
ACTCTGTTGT CATATAGATA GAAAGCAACA AACTTATTTC AAAGAGCTAA 
CAATGGCAAA AGAACCAAAA AAAGCATGCT AAGGCGGTGA CACCAAAAGG 
TGAGGGGGGC CTTGTGACTG ACAGCACCCC AAA CTATTG C CATTGTTTTA 
CTAAATGAAG ATCATTTTAG AAGCTCTCAG GAACTTCGAA AACAGTGGCT 
TTCCGTCCAC AGATCGTCTG TT A AT ATTTT TGTCCAGTG A TACTTTTTTT 
GCTCCTTACA AGAGTGCCTA TGTTGACATA TACATTGTTA AGTTGTTCAT 
AAGTTTACTT CTTATTCTAA ACAGCAAGTG CCTAATGCTT GCATTTATTT 
TGGCTATTTA TTTTTATTCT CATTTCAATC AACACTTTTG TTCAGGTGTT 
TGTCGATCAT CCGTCATATC ATAGACCAGG AAGTTTATAT GG AGATAATT 
TTGGTGCTTT TGGTGATAAT CAGGTACACT ACACTATACT AAGCTCCTAG 
TTGACTAAGT CGTA AGTTGT ACCTCCTCGC TGACCGGCTG CTCTATGTCG 
TGCAGTTCAG ATACACACTC CTTTGCTATG CTGCATGCGA GGCCCCACTA 
ATCCTTGAAT TGGGAGGATA TATTTATGGA CAGAATTGCA TGTTTGTTGT 
GAACGATTGG CATGCCAGCC TTGTGCCAGT GTACGTTGTT TGTGGATCTG 
AAAGTCCAAT CCTTTATTCA TTCTCTGCTT TGCAGTGTGC CCATGTCTAC 
ATTTCTTTTA TGCTTTTTTC ATGTCTGTTC TTATATTGCA T ATATG CTT A 
TGGAGTCTAA AAGTTACCGG AGGGAATAAC TCTTAAGG AT TTCCTCAATC 
AATTATCTTT AGCTTTAGTT AACATTTACT GTGGCAAACA TAATGTGTTT 
TGAGATTTAC AAGTTCAGAG ATTGCACTTC ACTAGTTCGT AGCTAATCTG 
ATGTTTTCCC CGAGAAAATG CCTAAAGCTT TGTGTCTTG A TGCATTGATA 
GAAA AAGAGT TTATGTACAC TCCCAAAGAG GGGACCCAAA ATTACAACAC 
CACACCCCTG AGAACTAGGC GCTGCCGGAA GAAGCGATGC AAGCCCCACT 
GCCCCTGCCT TAGCTCAAAG CCGGGCGTCA GCTTGATTGT GTCAAGTAAG 
CTAGCAGTGC TAGATTGCGC AAGGTCGATT CGTCGAAGAT GACAGTGTTG 
CGCTGCTTCC AAATCCACCA A A CTATG A GC ATGATCACTG GAGAAGTACC 
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TTTTCTCGCG GCTGAGGGGG TGGACTGGTG GTCTGCTGCT GCCAGTTTTC 3650 
AGATAATCTG AAAAATGCAT GTTTTGATGA TTTTAGTATC TTGCGGACCC 3700 
5 TGGGTACCAC CTAAGCTTTC ACACAGTAAT TTGCAGTTAC ACCTATAAAA 3750 
GTAACGGTCA TGATATGCAT GTGTTTTGGG TAGATCATGG TGCATGCATT 



10 



3800 



TTAGGAATTA GGACATGCCA GAACCACGTG AGGCTTATGG GGCAATTCAT 3850 
TTGTTCCATT ATACGAGTCA TGAATATGGT TCAGCATGTT TGGACGCTAC 



3900 



4000 
4050 



TTGTTTGGGG CAATTTCAGA TGGTGAATTG TAGCTGCTTG ATGTTGGCTA 3950 
1 5 GCTGGCTTAT TTTGTACAAG TATCGATGTT AGATGCATAT TTCCTTTTGT 

TCTTGTGCTG TTTGCCATGT TGTATTCCCC TTTTCTGTCG CCAGTGTTGC 

^ ATGTTAAATT GGTTTTCATT ACATAATCAA CTTTGTTGCT GACATCAGTC 4100 

ATTTTTATTC AGCCTTCTTG CTGCAAA ATA TAGACC ATAC GGTGTTTACA 4150 

GAGATTCCCG CAGCACCCTT GTTATACATA ATTTAGCACA TCAGGTTTGG 4200 

25 GTCTATCACC TTTCATTATC CGTACATGGC TTTGTAAGTC GGTTCACACG 4250 

TATCGTCATA CTGTATGTTA TTTCAATGTC ATTAGGGTGT GGAGCCTGCA 4300 

3 0 AGTACATATC CTG ATCTGGG ATTGCCACCT GAATGGTATG GAGCTTTAG A 4350 

ATGGGTATTT CCAGAATGGG CAAGGAGGCA TGCCCTTGAC AAGGGTGAGG 4400 

CAGTTAACTT TTTGAAAGGA GCAGTTGTGA CAGCAGATCG AATTGTGACC 4450 

3 5 GTCAGTCAGG TG AAATACTC A ATA CTTCTC 11 11 11C11 T GCGGGATGTT 4 500 

CTTCAGTTCA ATTGCCCTGT CTTTCACCCA ATTAAGAAAT G ATTTAATCT 4550 

TTTGTTTCTA GGGTTATTCA TGGGAGGTCA CAACTGCTGA AGGTGGACAG 4600 

40 

GGCCTCAATG AGCTCTTAAG CTCCCGAAAA AGTGTATTGA ATGGTAACTA 4650 

TATTTGAATC CACTTATCTT CTTCTGAAAC ATATTTACAG AAATAGATGG 4700 

4 5 ATGGGTTGCA AGAATAAATT CAGTTTGCTC TTTCGGTATG AAGGAATTGT 4750 

AAATGGAATT GACATTAATG ATTGGAACCC CACCACAGAC AAGTGTCTCC 4800 

CTCATCATTA TTCTGTCG AT G ACCTCTCTG G A A AGGTGTG TGGATAGTAC 4 850 

CCTATATAAT AACATGTATA TCTGATCTAG TACTTTCTTT TTCTTTGCTA 4900 

GTTTGCTTCC C ATG ATGTTC TCACTAACTA ATCCTATGTG GTTTGGCATA 4950 

5 5 CTTGTCAGGC CAAATGTA AA GCTGAATTGC AGAAGGAGCT GGGTTTACCT 5000 
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GTAAGGGAGG ATGTTCCTCT GGTTAGATAC AAACCCCTAA GATATATATT 5050 

TTTTA AATCC CTAA A A A AAA CTTGCCG ATC ATCTCATTAG CTTGATTCAC 5 1 00 

5 AG ATTGGCTT TATTGGAAGA CTGGATTACC AG AAAGGCAT TG ATCTC ATT 5 1 50 

AAAATGGCCA TTCC A GAG CT CATGAGGGAG GACGTGCAGT TTGTAAGTTC 5200 

ATATTCTTTT TCTTG AGACT AGAGTATAAA TCAAACATGT AGGTGTGGGG 5250 

10 

TGGTATAATA C AG AC ATA AG TTCCAGCTAT TGCTTCCATG AGAATTTTAA 5300 

TGCTATTCAG TAATATGCTA CTGCAAGTTT TGAAACAAAG TTGG AAGCAA 5350 

1 5 TAAATATATG TGTAGCACTG ACCATGCAGT GCCACTATAG CTGG AATGTC 5400 

CTGTAGTCTA TGTGATCTAA CACACTCAAC AACATGTTTT CGCATACAAA 5450 

CACATGCGTG CGCGCAACAA ACATACTCTA CAATAAAATT GGCTTGGTGA 5500 

20 

ACTGCAGACA TGCTCTTATC TCC ATTCC A A CATTTCTTGT TTCAACATTG 5550 

GCTGAAGACT AAG AGAAGGG GGACCCAGGG TGATGTAGCC AACTAGATCC 5600 

2 5 AGTAAGGAAG CTAGCCGAGC CTAGG AGGAT TCGCTTAGGT AGCTGGAACG 5650 

TAGGGTCTCT GACAGGGAAG CTTCGGGAGC TAGTCGATGC AGTGGTGAGG 5700 

AGAGGTGTTG ATATCCTTTG CGTCCAAGAA ACCAAATGTA GGGGACAGAA 5750 

30 

GGCGAAGGAG GTGGAGGATA CCGGCTTCAA GCTGTGGTAC ATGGGACGGC 5800 

TGCAAACAGA AATGGCGTAG GCATCTTGAT CAACAAGAGC CTTAAGTATG . 5850 

3 5 GAGTGGTAGA CGTCAAGAGA CGTGGGGACC GCATTATCCT CGTCAAGCTG 5900 

GTAGTTGGGG ACTTAGTTCT CAATGTTATC AGCGTGTATG CCCCGCAAGT 5950 

AGGCCACAAT GAGAACGCCA AGAGGGAGTT CTGGGAAGGC CTGGAAGACA 6000 

40 

TGGTTAGGAG TGTACCGATT GGCGAGAAGC TCTTCATAGG AGGAGACCTC 6050 

AATGGCCACG TGGGTACATC TAACATAGGT TTTG A AGGGG C AC ATGGGGG 6 1 00 

4 5 CTTTGGCTAT GGC ATC A AG A ATCA AG A AG A AG ATGTCTTA CGCTTTGCTC 6 1 50 

TAGCCTACGA CATGATTGTA GCTAACACCC TCTTTAGAAA GAGAGAATCA 6200 

CATCTGGTGA CTTTTAGTAG TGGCCAACAC TAGCCAGATC GATTTCATCC 6250 

50 

TCTCGAGAAG AGA AGATAGG TGTGCGCGCC TAGACTGCAA GGTGATACCT 6300 

TCGGATTCGT GTCCAGCGGG ATAAGCGTGC CAAAGTCGCT AG AATGAAGT 6350 

5 5 GGTGGAAGCT CA AGGGGG AG CTAGCTCAGG CGTTC AAGG A G AGGGTCATT 6400 
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AGGGAGGGCC CTTGGGAGGA AGGAGGGG AT GCGGACAATG TGTGGATGAA 6450 
GATGGCGACT TGCATTCGTA AGGTGGCCTC GGAGGAGTGT G G A GTGTCC A 6500 
5 GGGGATGGAG AAGCGAAGAT AAGGATACCT GGTGGTGGAA TGATGATGTC 7000 
CAGAAGGCAA TTAAAGAGAA GAAAGATTGC TTTAGACGCC TATACTTGGA 7050 
1 0 TAGG AGTGCA GTCAACATAG A AAAGTAC AA GATGGCGA AG A AGGCCGCA A 7 1 00 

AGCG AGCTGT CAGTGAAGCA AGGGGTCGGG C AT ATG AG G A TCTCTACCA A 7150 

CGGTTAGGCA CGAAGGAAGG CGAAAGGGAC ATCTATAAGA TGGCCAAGAT 7200 
CCGAGAGAGA GGAAGACGAG GG ATATTGGC CAAGTCAAAT GCATCAAGG A 7250 

1 5 TGGAGCAG AC CAACTCTTGG TGAAGGACGA GGAG ATTAAG CATAGATGGC 7300 

GGGAGTACTT CGACAAGCTG TTCAATGGGG AGGATGAGAG TCCTACCATT 7350 

GAACTTGACG ACTCCTTTGA TGAGACCATC ATGCGTTTTA TGCGGCGAAT 7400 

CCAGGAGTCC GAGGTCAAGG AGGCTTTAAA AAGGAGGCAA GGCGATGGGC 7450 

CCTGATTGTA TCCCCATTGA GGTGTGGAAA GGCCTCGGGG ACATAGCGAT 7500 

2 0 AGTATGGCTA ACCAAGCTAT TCAACCTCAT TTTTCGGGCA AACAAGATGC 7550 

CAGAAGAATG GAGACGAAGT ATATTAGTAC CAATCATCAA ACAGGGGGGA 7600 

TGTTCAGAGT TGTACTAATT ACCATGGAAT TAAGCTGATG AGCCATACAA 7650 

TGAAGCTATG GGAGAGAATC ATTGAGCACC GCTTAAGAAG AATGACAAGC 7700 

GTGACCAAAA ATCAGTTTGG TTTCATGCCT GGGAGGTCGA CCATGGAAAC 7750 

25 CATTTTCTTG GTACGACAAC TTATGGAGAG ATACAGGGAG CAAAAGAAGG 7800 

ACTTGCATAT GGTGTTCATT GACTTGAAGA AG G CCTAT A A TAAGATACCG 7850 

CGGAATGTCA TGTGGTGGGC CTTGGAGAAA CACAAAGTCC CAGCAAAGTA 7900 

CATTACCCTC ATCAAGGACA TGTACGATAA TGTTGTGACA AGTGTTCGAA 7950 

CAAGTGATGT CGACACTAAT GACTTCCCGA TTAAGATAGG ACTGCATCAG 8000 

3 0 GGGTCAGCTT TGAGCCCTTA TCTTTTTGCC TTGGTGATGG ATGAGGTCAC 8050 

AAGGGATATA CAAGGAGATA TCCCATGGTG TATGCTCTTT GTGGATGATT 8 1 00 

TGGTGCTAGT TGACGATAGT CGGGCGGGGG TAAATAACAA GTTAG AGTTA 8 1 50 

TGGAGACAAA CCTTGGAATC GAAAGGGTTT AGGCTTAGTA GAACTAAAAC 8200 

CGAGTACATG ATGTGCGGTT TCAGTACTAC TAGGTGTGAG GAGGAGGAGG 8250 
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TTAGCCTTGA TGGGCAGGTG GTACCCCAG A AGGACACCTT TCG ATATTTG 8300 

GGGTCAATGC TGCAGGAGGA TGGGGGTATT G ATG A AG ATG TGAACCATCG 8350 

AATCAAAGCT GGATGGATGA AGTGGCGCCA AGCTTCTGGC ATTCTTTGTG 8400 

ACAAGAGAGT GCCACAAAAG CTAAGGCAAG TTCTACAGGA CGGCGGTTCG 8450 

5 ACCCGCAATG TTGTATGGCG CTGAGTGTTG GCCGACTAAA AGGCGACATG 8500 

TTCAACAGTT AGGTGTGGCG GAGATGCGTA TGTTGAGATG GATGTGTGGC 8550 

CACACGAGGA AGRATCGAGT CCGGAATGAT G ATATACGAG ATAGAGTTGG 8600 

GGTAGCACCA ATTGAAGAGA AGCTTGTCCA ACATCGTCTG AGATGGTTTG 8650 

GGCATATTCA GCGCACGCCT CCGAAAACTC CAGTGCATAA CGGACGGCTA 8700 

1 0 AAGCGTGCGG AGAATGTCAA G AGAGGGCGG GGTAGACCG A ATTTG AC ATG 8750 

GGAGG AGTCC GTTAAGAGAG ACCTG AAGGT TTGGAGTATT ACGAAAGAAC 8800 

TAGCTATGGA CARGGGTGCG TGGAAGCTTG TTATCCATGT GCCAGAGCCA 8850 

TGAGTTGATC ACGAGATCTT ATGGGTTTCA CCTCTAGCCT ACCCCAACTT 8900 

GTTTGGGACT AAAGGCTTTG TTGTTGTTGT TGTTGTTGTT GTTGTAGCCA 8950 

1 5 ACTAA ATCCA GTTGATCAGT GGTTTTTACT CTTATTTTTA C AGGTCATGC 9000 

TTGGATCTGG GGATCCAATT TTTGAAGGCT GGATGAGATC TACCG AGTCG 9050 

AGTTACAAGG ATAAATTCCG TGG ATGGGTT GGATTTAGTG TTCC AGTTTC 9 1 00 

CCA C AG A ATA ACTGCAGGGT ATGCCG AG AA CTTCTTA AC A AG ACCTTCGT 9150 

TATCAGCTTG GATATATTAT AATGTTCAAA ACATTTATGT CTCTCTTTTT 9200 

2 0 GTGCAGTTGC G ATATATTGT TAATGCCATC CAGGTTTGAA CCTTGTGGTC 9250 

TTAATCAGCT ATATGCTATG CAATATGGTA CAGTTCCTGT AGTTCATGGA 9300 

ACTGGGGGCC TCCGAGTAAG ACA ACTGCCT TGAAAATTAT CGTTATCTTG 9350 

GCTCCAACGC AAATGTTCTA ATTGGCTCGT GTATTCAACA GGAC ACAGTC 9400 

GAGACCTTCA ACCCTTTTGG TGCAAAAGGA GAGGAGGGTA CAGGGTACGC 9450 

2 5 ACTGCTCAAT TTTAGCTAAC TTTC AGTTTA TCTTTTTGCA ATGTCTTGGG 9500 

GGTTCATTGC GCCATAA ATC A ACTTGTGAT AATTAACTGT T A CTGTTCTG 9550 

TACTTGCAGG TGGGCGTTCT CACCGCTAAC CGTGGACAAG ATGTTGTGGG 9600 

TAAG lTJTl G CTG AGCTCTT GTCCGGTTAT AGGATCGACC TTGGCTGTAG 9650 
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CATGGTACCT TAGTGCCCCT TGTATATAG A CCTAACCTGA TGGACTCACT 9700 
TTGTCTACAC TAATCATAGT AGTCGATTGC CCGGAGGCGT TTTGCTTGGA 9750 
TTCTGCTAAT TTAATTTTCA TG ACGATAAC TCATACCATG GTTTGGTTCT 9800 
CCGATGGGGG CCAGAATGGC GTCTAGTGTC TGCGATCTGT GTAACTAGCC 9850 



30 



AATGCCGGGT TGTTCCA AGT GAAAATTTAC CTTTTGACCA TTGTGCAGGC 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



9900 



ATTGCGAACC GCG ATGTCG A CATTCAGGGA GCACAAGCCG TCCTGGGAGG 9950 



10000 



10150 
10200 



10350 
10400 



GGCTCATGAA GCGAGGCATG ACGAAAGACC ATACGTGGGA CCATGCCGCC 

GAGCAGTACG AGCAGATCTT CGAATGGGCC TTCGTGGACC AACCCTACGT 10050 

C ATGTAG ACG GGGACTGGGG AGGTCG AAGC GCGGGTCTCC TTG AGCTCTG 1 0 1 00 

1 0 AAGACATGTT CCTCATCCTT CCGCGGCCCG GAAGGATACC CCTGTACATT 

GCGTTGTCCT GCTACAGTAG AGTCGCAATG CGCCTGCTTG CTTGGTCCGC 

CGGTTCGAGA GTAG ATGACG GCTGTGCTGC TGCGGCGGTG ACAGCTTCGG 1 0250 

GTGG ATGACA GTTACAGTTT TGGGG AATAA GGAAGGGATG TGCTGCAGGA 1 0300 

TGGTTAACAG CAAAGCACCA CTCAGATGGC A GCCTCTCTG TCCGTGTTAC 

1 5 AGCTGAAATC AG A AACCAAC TGGTG ACTCT TT A GCCTTAG CGATTGTGAA 

GTTTGTTGCA TTCTGTGTAT GTTGTCTTGT CCTTAGCTG A CAAATATTTG 1 0450 

ACCTGTTGG A TA ATTCTATC TTTGCTGCTG 1 1 1T11TH T GGTCA A A AG A 1 0500 

GGGGTTCCCT CCGATTTCAT TAACG AAACC A CCA A AATAA CAGCACCCAG 10550 

TGCAGGTCTC AGGTTCAGAT ATACTTAAGA CTACTAAATC TAACAGCAGC 10600 

2 0 TA AA A AGCTT AA AG ATTCAG GCG ACATAAC CG A AC A A A AT CCACA ACCG A 1 0650 

AGGGACCAAA GCAGGACAAG TAAAAAGGCA GNCGACACAA AGCGCAGGTC 10700 

GCTG A AAAGG CA AGC AGACA G AGGTCTGCA TTCTGTC A AC ACCACTTGTG 1 0750 

A A A AATG A AG AG AAG ATCG A G A ATTCCCGG G AATCCG 1 0787 

(2) INFORMATION FOR SEQ ID NO: 14: 
2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 647 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



15 



30 



45 



60 
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(vi) ORIGINAL SOURCE: 
(A) ORGANISM: triticum tauschii 
(F) TISSUE TYPE: Endosperm 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: I. .647 

(D) OTHER INFORMATION:/product= "deduced amino acid 
sequence for SSS I" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met Ala Ala Thr Gly Val Gly Ala Gly Cys Leu Ala Pro Ser Val Arg 
1 5 10 15 

Leu Arg Ala Asp Pro Ala Thr Ala Ala Arg Ala Ser Ala Cys Val Val 
20 25 30 

Arg Ala Arg Leu Arg Arg Leu Ala Arg Gly Arg Tyr Val Ala Glu Leu 
20 35 40 45 

Ser Arg Glu Gly Pro Ala Ala Arg Pro Ala Gin Gin Gin Gin Leu Ala 
50 55 60 

2 5 Pro Pro Leu Val Pro Gly Phe Leu Ala Pro Pro Pro Pro Ala Pro Ala 

65 70 75 80 



Gin Ser Pro Ala Pro Thr Gin Pro Pro Leu Pro Asp Ala Gly Val Gly 
85 90 95 

Glu Leu Ala Pro Asp Leu Leu Leu Glu Gly lie Ala Glu Asp Ser lie 
100 105 HO 



Asp Ser lie lie Val Ala Ala Ser Glu Gin Asp Ser Glu lie Met Aso 

35 115 120 125 

Ala Asn Glu Gin Pro Gin Ala Lys Val Thr Arg Ser lie Val Phe Val 

130 135 140 

4 0 Thr Gly Glu Ala Ala Pro Tyr Ala Lys Ser Gly Gly Leu Gly Asp Val 

145 150 155 160 



Cys Gly Ser Leu Pro lie Ala Leu Ala Ala Arg Gly His Arg Val Met 
165 170 175 

Val Val Met Pro Arg Tyr Leu Asn Gly Ser Ser Asp Lys Asn Tyr Ala 
180 185 190 



Lys Ala Leu Tyr Thr Gly Lys His lie Lys lie Pro Cys Phe Gly Gly 

50 195 200 205 

Ser His Glu Val Thr Phe Phe His Glu Tyr Arg Asp Asn Val Asp Trp 

210 215 220 

55 Val Phe Val Asp His Pro Ser Tyr His Arg Pro Gly Ser Leu Tyr Gly 

225 230 235 240 



Asp Asn Phe Gly Ala Phe Gly Asp Asn Gin Phe Arg Tyr Thr Leu Leu 
245 250 255 

Cys Tyr Ala Ala Cys Glu Ala Pro Leu lie Leu Glu Leu Gly Gly Tyr 
260 265 270 



4 




PCT/AU98/00743 



He Tyr Gly Gin Asn Cys Met Phe Val Val Asn Asp Trp His Ala Ser 
275 280 285 

Leu Val Pro Val Leu Leu Ala Ala Lys Tyr Arg Pro Tyr Gly Val Tvr 
290 295 300 

Arg Asp Ser Arg Ser Thr Leu Val He His Asn Leu Ala His Gin Gly 
305 310 315 320 

Leu Glu Pro Ala Ser Thr Tyr Pro Asp Leu Gly Leu Pro Pro Glu Tro 
325 330 335 " 

Tyr Gly Ala Leu Glu Trp Val Phe Pro Glu Trp Ala Arg Arg His Ala 
340 345 350 

Leu Asp Lys Gly Glu Ala Val Asn Phe Leu Lys Gly Ala Val Val Thr 
355 360 365 

Ala Asp Arg lie Val Thr Val Ser Gin Gly Tyr Ser Trp Glu Val Thr 
370 375 3 8 o 

Thr Ala Glu Gly Gly Gin Gly Leu Asn Glu Leu Leu Ser Ser Arg Lys 
385 390 395 40Q 

Ser Val Leu Asn Gly He Val Asn Gly He Asp He Asn Asp Trp Asn 
405 410 415 

Pro Thr Thr Asp Lys Cys Leu Pro His His Tyr Ser Val Asp Asp Leu 
420 425 430 

Ser Gly Lys Ala Lys Cys Lys Ala Glu Leu Gin Lys Glu Leu Gly Leu 
435 440 445 

Pro Val Arg Glu Asp Val Pro Leu He Gly Phe He Gly Arg Leu Asp 
450 455 46Q 

Tyr Gin Lys Gly He Asp Leu He Lys Met Ala He Pro Glu Leu Met 
465 470 475 480 

Arg Glu Asp Val Gin Phe Val Met Leu Gly Ser Gly Asp Pro He Phe 
485 490 495 

Glu Gly Trp Met Arg Ser Thr Glu Ser Ser Tyr Lys Asp Lys Phe Arg 
500 505 510 

Gly Trp Val Gly Phe Ser Val Pro Val Ser His Arg He Thr Ala Gly 
515 520 525 

Cys Asp He Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn 
530 535 540 

Gin Leu Tyr Ala Met Gin Tyr Gly Thr Val Pro Val Val His Gly Thr 
545 550 555 560 

Gly Gly Leu Arg Asp Thr Val Glu Thr Phe Asn Pro Phe Gly Ala Lys 
565 570 575 

Gly Glu Glu Gly Thr Gly Trp Ala Phe Ser Pro Leu Thr Val Asp Lys 
580 585 590 

Met Leu Trp Ala Leu Arg Thr Ala Met Ser Thr Phe Arg Glu His Lys 
595 6 oo 605 
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Pro Ser Trp Glu Gly Leu Met Lys Arg Gly Met Thr Lys Asp His Thr 
610 615 620 

5 Trp Asp His Ala Ala Glu Gin Tyr Glu Gin lie Phe Glu Trp Ala Phe 

625 630 635 640 

Val Asp Gin Pro Tyr Val Met 
645 

■10 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5072 base pairs 

(B) TYPE: nucleic acid 

1 5 (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

2 0 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: triiicum tauschii 
(F) TISSUE TYPE: Endosperm 
25 (ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATIONS. .4993 

(D) OTHER INFORMATION:/function= "region containing 
promoter of SSS I" 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 





TCT AG ATGC A 


TGCTGGATAG 


CGGTCGATGT 


GTGGAGTAAT 


AGTAGTAGAT 


GCAGAATCGT 


60 


35 


TTCGGTCTAC 


TTGTCGCGGA 


CGTGATGCCT 


ATATACATGA 


TCATACCTAG 


ATATTCTCAT 


120 




AACT ATGC TC 


AATTCTATCA 


ATTGCTCGAC 


AGTAATTCGT 


TTACCCACCG 


TAATACTTAT 


180 


40 


GATCTTGAGA 


GAAGTCACTA 


GTGAAACCTA 


TGCCCCCCAG 


GTCTATTTTG 


CATCATATTA 


240 


ATCTTCCAAT 


ACTTAGTTAT 


TTCCATTGCC 


GTTTATTTTA 


CTTTGTATCT 


TTATTTCTTT 


300 




TTATTATAAA 


AAATAC C AAA 


AATATTATCT 


TATCATATCT 


ATCAGATCTC 


ATTCTCGTAA 


360 


45 


GTGACCGTGA 


AGGGATTGAC 


AACCCCTTTA 


TCGTGTTGGT 


TGCGAGGTTC 


TTGTTTGTTT 


420 




GTGTAGGTGC 


GTGTGACTCG 


CACGTCTCCT 


ACTGGATTGA 


TACCTTGGGT 


TTTCAAAAAC 


480 


50 


TGAGAAAAAT 


ACTTACGCTA 


CTTTACTGCA 


TAACCCTTTC 


CTCTTTAAAA 


AAAAAAACCA 


540 


ACGTAGTATT 


CAAGAGGTAG 


CACGCTACCA 


TCCTCTCCAA 


CAGGAGCGCG 


GAGATCTTTG 


600 




TCCGGCAGGT 


TGATGCGGGC 


CGGGGAAGAA 


CTCCAGCTGC 


CTTGGCCAGC 


TTGGTCGTGA 


660 


55 


GCCGCCCCAG 


CGGCGTCTTG 


AACCTGTCCA 


CGTAGCGCTC 


CCTGACACGC 


GGCGTGAACT 


720 




GAGAAGGCTT 


GTCGATGAAC 


TCCAGCTGTT 


GTGCCAGCCT 


AGCTTGCGCC 


TTCTTCTGCT 


780 


60 


GGGTC ATGCC 


CTTCGAGAAA 


CCCACCTTGG 


CCACCCTTGT 


GCTTGAGCGG 


CGCGCCACCT 


840 


CAGCAGGCGG 


CGGCGTGGGG 


ATGAAGAGGG 


TGTCTGCTTC 


CGGAGCAGGC 


GGGTCGGCGT 


900 
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TGAACTTGAA AGGCGGTGGC CCCATGATGG ATGGGGGGAG CATGCCAAAG ACTTGG TTG A 96 0 
GGAAAGTGGT GTTGGCGTCC ACCTCCAGTG CCTGCAGTTT GGAAGCCAGA CGATTGGCGT 102 0 
5 CGATCTCTGG CTCCGGCTGG AAGGAGGCTC GACGCTCCGG TGTGCCAGAA CGCAAAGGGA 1080 
GGAGCGGCAG CTCTGGCTGA GCAGACCCCG CGCCCATGTA CTCTGCATTG GGCCAAGGCT 114 0 
io GCAGGGGCAA GCCACCGGGA TGGGGGCGCG AGGTGGACTG CGCACCGGAG GAAGGCCAAG 12 00 
CTCAACCTCG GTGAGGTTCG CCCCAGACCA GGGCGGCAGG CTCGGGTCCA CAAAGGGCCA 1260 
AACCGCCTCG TCCGCCCCGA AACTGTCCAG GACAGACGGC GGACGACGGA AGGCCGTGTC 13 20 
15 GTCG AGCTCG AGCAGCAGAG GGTCCGTGCG GGTGATGTCT TGCCAAATGG ACTCCACCTC 13 80 
CAGCAGGAAG GGGGACTGGT CCATCGCCCC TGGCCAAGCC ACTGGTACGC CAAAGATGGC 1440 
2q ATCAGCAGCG TTTGCACCAG GGGGAGCAGC CACACCTTGG AGGACAGGGA GGGTGCGGAC 1500 
:: GTCGACGGCA GCAAAACGTG GCTGGAGCAA GTTGCCGTCG CGTGCCGGCC TCGGCGAGCG 1560 
CGAGCGGCTG TAGGAGCGCT CGGTGCCCTC AGACTCGGAC AGTGCGCCAG TGGGAGAGCC 1620 
25 ATGGCGACGC CGGCCACCAC TGGACGTGCC ATGGCGCTGG TCCTGACGGC GCCTGGATGG 1680 
CCCGTCCTCG CGGGCAGCTC CACCTGAGCG GCACCCGAGG AGCACACCCC GCCAAGCTGG 1740 
3 0 GCCAGGGCGG CTGCGGCGAC GGCGACGGCC GCGGTCGCGG TCTGCACCAT CATCTTCATC 1800 
TTCGTCATCG TGGCGCCTCG GACAAGGATG CTCGCTGTCA CCGACGCGAG GGACGTGAGC 1860 
CGGCTCAGCC CGCCCTTCCT CGACGTGGCG AGCCCTGCGG ATATGCTCCT CGAGCGGCCA 1920 
35 TTGGGGGTCG TTGGCGCGCG GCATCTCGGG GTCGCGGTCA GCTATCGGGG TGTAGTCCTT 19 80 
TGTGGTGTCC AGG TGGATG A GCAGAGAGAA ATCCGGCCCC TCTAGCCCCT CGTCCCGGGG 2040 
4q GCAGCCCTCC GGCAGCGTCT GGCGGCCCCT GGGGTCCAGG GGTCGATCGA TGATGGAGAA 2100 
CCCCCTTTTG GTGGGGATGT CGTCCGGACT CCATGCCCAC ACCCAGGCAA AGAGGCAGGC 2160 
CGTGTTGGAG AGGGAGGTCG TCTGCCGCTC CAACCAGTCG ACGTGGCATG TCTTCCCGAG 2220 
45 CGCATCCTGC CCCGCCTCCT TGTTCCAGGA CTGCACCGGC ATGTTCTCGA CGGCGATGCG 2280 
.- GCAGTAGTAC CGCCAGACAC GGCGGTGGCC GTGTGCCGAT GGTGACCAGG CCGACAGGGA 23 4 0 
50 GAGCGCGACG CCCCAGCAGG AGACGACCCC AGCGTCGAAA GCGATGTCCC GGTGCCTGAA 24 00 
GTGGACGAGC CCAGAGATGG CCAGGCGCAT TGACGCGGGG AAGGGGAAGG AGTTAGGATG 24 60 
GGCGACGCGG CCGGAGTGAA CCGCGGCGTG GTGGCCGACG GGGCTGGAGA GGCAGAGGCG 2 52 0 
55 GAGTCATCCG AGAGAGGTGT ATCAGTGGCT CTGCACAATA CCCAGTGTCG CCACATCATA 2580 
TCCTGCTGAA TAACCACACA TGTGT AC TGT CGTTAAATAA ATCATTGGTC ACGCGAACCC 2640 
6o GGAAAAAGAC GGCGAAAAAT TCACGGACAC ACGACTAGTA GTACCCAATA TACTCGGCAA 27 00 
AAACAGTGAC ACGTCGTTTT GCGTTGTCGG CCGGTGTTGT CGAGTCATTG TACTATGTTT 27 60 
TGTCGTTTCT TTCTTTTCTC CAAATCGACA AACCGTTTGT CTTTGGTTAA AAAACAGAAA 2 82 0 
65 CATACAAAAT CAAATGAATG CATTCAAGGG CCGGTAATCC AATTCTGAGC CCAGGCTCAG 2 880 
CTACACCCGC CCTTACAAAA AAATCAAAAT AAATACTAGA AAAATTCAAA AAATTCCAAT 294 0 
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TTGTTTGTGC GTGGTAGATA ATTTGATGCG TGAGGTACGC TTC AATTTTC AAATTATTTG 3 000 
GACATCTGAG CAGCTCTCAG CAAAAAAGAC AAATTCGGGG TCTGTAAAAA TGTTTACTGT 3 0 60 
TCATGCACTG TTCTGACCCG ATTTGTCTTT TTTGCTGAGA GCTTCTCAGA AGTCCAAATG 3120 
AGCTAAAATT TTGAGCGGAG CTTACGTGAT AAAATGTCTA TCATGCAAAA AAGGATTGGA 3180 
10 ATTTTTTGAA TTTTTTTTAT TTTTTGTGAT TTGTTTC CTG GACGGGTGCA GATAAGCCTG 3 2 40 
GGCACCGAAA CGCCGCACTC AGGCTCATCC TTTTCTATAA AAGAAAAGAA ATACATACAA 33 00 
TTTCCCTCTG TTTTTTGAGC AAGGGGCACC ACCCACCAAA GAGTTTTCAA CTCACATGGT 3 3 60 

15 

ATT AG AGC AT CTACAGCCGG GCGTCTCAAA CCAGCCTCAT ACGCTTGAGC GGGTCGCCTT 342 0 
GGTCACGATT TTTTGACCCA GACGGGCCCC TCAAACGGTC CTTAAACGCC CAGGCTGACC 3 480 

2 0 GACAACCCAC ATATCCAGCC CAAATATGGG GTGGATATGG GGGCGCCCGG GCACGCCAGC 3 540 

CCGCGGACAC CACACATCTT CAGTTTCTAA TTTGAGATAT CCGGATGTGG AATGCGTTTT 3 600 
TGAGGGGTGA CCGGTCCCTG TCCGTGGATG CGCCCGGACG TTTGAGGGGT TGGATTTGCC 3 660 

25 

AAGTC TGATT AGAGATGCTC TTAGGTGTTC CACCCCCATC CCTTGATGGC TAGGGCAAAC 372 0 
TCTCCCCTCC AAACTTTGTC GGCGAGCCTG TGGATTCTTC TCTCCTCTGC CCGCTGCTCC 37 80 

3 0 GGCGGCTGAT GGCGGGGAGG AGAATCCCGG TGTCTTCGCT TGGTTAGTTG TTTAAGTTAC 3 840 

GTACTTTTTT AGTCCTCGCA GGTGCGGCGT TCGGACGTAT GGTCGTGCTT CTTTTTTGAG 3 900 
TTTGTCTTCC GGGCTCTGAT CCTCCTCGAG TTCGTC CATC TGGACGTACT CGACGGAGCT 3 960 
CCGGCATAGA TTCCTATCAT CGTCTTGGTG AGGTGAGGTT ATGGTTTCTT GTCATGTGGG 402 0 
C AG ATTTGG T GCCAGATGCT TCATATCTAT TCAAGGGTTC AGCGGCAACA ACTGCGGCTC 4080 

4 0 CAGAGCGATG GTC CTTAAGG GCACGTGCAC GAAGACTTCA CGGCTGTTAT CGACAAGGTC 414 0 

AAGCCGGCTC CGATAGGGGA GCAGCGACAG CGGCGCGTCA ACCGCTCGTT CTGGCGGCAG 4200 
TAGTGGTCGT TCGGTGCTCT CGGAACCTCG ATGTAATTTT TATGATTTTA GAGATGCTTT 4260 

45 

GTACTTCCGA TCGATGAACT CTGATAATAG ATATCTCTTC TCTCGCAAAA AAAGAGAGTT 43 2 0 
TTCAACTGAA AACAAAAGAG TTTCACTAGT TCTTCTTTTA GAAACAGAGT TTCACTAGCA 43 80 
50 CTTTTTTTTG CGAGAAGTCG AG TTTC AC T A AGTACTAAAC CCACGCAATT ATTCTCAAAA 4 440 
AAAAAACCCA CGCAACTGTC TGGATCCATC TTCGTTTTTT CCCCGAGAAT CGTCTGGATC 4 500 
CATTTTCGTG TGCGAGGCAT CCTCTCATTT TGCACGGCCC AGCTCTCTTC TCGCCGGCGT 4 560 

55 

ACGCTGCTAC ATGTCGGCAC TCCACGCAAA CAAAAAGAAG CCCAACCGAA AACGCACGCG 4 62 0 
CCTTTCCAGG CTCACCACGG AAAAAAATAC CACGCGCCGC TCACGAGCAA ACCGTGACAA 4 680 
60 CAGCCAGCCA GATATGGCAA CGGAGGCACG GGCCGCACAC AGCCACTGAA AACCGCAGCT 4 74 0 
GCTCTTCCGT CCGTCCGTCC CTCCGCCCGT CCGCGCCACT CCACTCGCCT TGCCCCACTC 4 800 
CCACTCTTCT CTCCCCGCGC ACACCGAGTC GGCACCGGCT CATCACCCAT CACCTCGGCC 4 860 
TCGGCCACCG GCAAACCCCC CGATCCGCTT TTGCAGGCAG CGCACTAAAA CCCCGGGGAG 4 920 



65 
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CGCGCCCCGC GGCAGCAGCA GCACCGCAGT GGG AG AG AG A GGCTTCGCCC CGGCCCGCAC 4 98 0 
CGAGCGGGGC GATCCACCGT CCGTGCGTCC GCACCTCCTC CGCCTCCTCC CCTGTCCCGC 504 0 
5 GCGCCCACAC CCATGGCGGC GACGGGCGTC GG 5072 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1706 base pairs 
1 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: trilicum tauschii 
20 (F) TISSUE TYPE: Endosperm 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1„ 1706 

2 5 (D) OTHER INFORMATION:/product= "partial cDNA for 
hexaploid wheat DBE" 



30 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GCT GTG TCG AAG CTT GAC TAT TTG AAG GAG CTT GGA GTT AAT TGT ATT 
Ala Val Ser Lys Leu Asp Tyr Leu Lys Glu Leu Gly Val Asn Cys He 
15 10 15 



40 L 3 5 Gly Thr Ile ASn Phe Phe Ser 



48 



GAA TTA ATG CCC TGC CAT GAG TTC AAC GAG CTG GAG TAC TCA ACC TCT 9 6 

Jb Glu Leu Met Pro Cys His Glu Phe Asn Glu Leu Glu Tyr Ser Thr Ser 

20 25 30 



TCC AAG ATG AAC TTT TGG GGA TAT TCT ACC ATA AAC TTC TTT TCA 144 

-ly Tyr Ser Thr Ile Asn 
40 45 



CCA ATG ACG AG A TAC AC A TCA GGC GGG ATA AAA AAC TGT GGG CGT GAT 192 
Pro Met Thr Arg Tyr Thr Ser Gly Gly Ile Lys Asn Cys Gly Arg Asp 
50 55 60 

GCC ATA AAT GAG TTC AAA ACT TTT GTA AGA GAG GCT CAC AAA CGG GGA 240 
Ala Ile Asn Glu Phe Lys Thr Phe Val Arg Glu Ala His Lys Arg Gly 
65 70 75 80 

ATT GAG GTG ATC CTG GAT GTT GTC TTC AAC CAT ACA GCT GAG GGT AAT 288 
Ile Glu Val Ile Leu Asp Val Val Phe Asn His Thr Ala Glu Gly Asn 
85 90 95 

GAG AAT GGT CCA ATA TTA TCA TTT AGG GGG GTC GAT AAT ACT ACA TAC 336 
bb Glu Asn Gly Pro lie Leu Ser Phe Arg Gly Val Asp Asn Thr Thr Tyr 
iOO 105 no 

TAT ATG CTT GCA CCC AAG GGA GAG TTT TAT AAC TAT TCT GGC TGT GGG 3 84 
Tyr Met Leu Ala Pro Lys Gly Glu Phe Tyr Asn Tyr Ser Gly Cys Gly 
bU 115 120 125 
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AAT ACC 
Asn Thr 
130 

5 TGT TTA 
Cys Leu 
145 

GAT CTT 
10 Asp Leu 



AAC GTG 
Asn Val 

15 

CCT CTT 
Pro Leu 

20 

CTT GGA 
Leu Gly 
210 

25 TAT CAA 
Tyr Gin 
225 

GGG AAG 
30 Gly Lys 



TTT GCT 
Phe Ala 

35 

CAG GCA 
Gin Ala 

40 

CAT GAT 
His Asp 
290 

45 AAT TTA 
Asn Leu 
305 

AGC TGG 
50 Ser Trp 



AGA TTG 
Arg Leu 

55 

TCT CAA 
Ser Gin 

60 

AAA GGG 
Lys Gly 
370 



TTC AAC TGT 
Phe Asn Cys 



AGA TAC TGG 
Arg Tyr Trp 



GCA TCC ATA 
Ala Ser He 
165 

TAT GGA GCT 
Tyr Gly Ala 
180 

GTT ACT CCA 
Val Thr Pro 
195 

GGC GTC AAG 
Gly Val Lys 



GTA GGT CAA 
Val Gly Gin 



TAC CGG GAC 
Tyr Arg Asp 
245 

GGT GGT TTT 
Gly Gly Phe 
260 

GGA GGA AGG 
Gly Gly Arg 
275 

GGA TTT ACA 
Gly Phe Thr 



CCA AAT GGG 
Pro Asn Gly 



AAT TGT GGG 
Asn Cys Gly 
325 

AGG AAG AGG 
Arg Lys Arg 
340 

GGA GTT CCA 
Gly Val Pro 
355 

GGC AAC AAC 
Gly Asn Asn 



AAT CAT CCT 
Asn His Pro 
135 

GTG ATG GAA 
Val Met Glu 
150 

ATG ACC AGA 
Met Thr Arg 



CCA ATA GAA 
Pro He Glu 



CCA CTT ATT 
Pro Leu He 
200 

CTC ATT GCT 
Leu He Ala 
215 

TTC CCT CAC 
Phe Pro His 
230 

ATT GTG CGC 
He Val Arg 



GCC GAA TGT 
Ala Glu Cys 



AAA CCT TGG 
Lys Pro Trp 
280 

CTG GGT GAT 
Leu Gly Asp 
295 

GAG AAC AAT 

Glu Asn Asn 
310 

GAG GAA GGA 

Glu Glu Gly 



CAG ATG CGC 
Gin Met Arg 



ATG TTT TAC 
Met Phe Tyr 
360 

AAT ACA TAC 
Asn Thr Tyr 
375 



101 - 

GTG GTT CGT 
Val Val Arg 



ATG CAT GTT 
Met His Val 
155 

GGT TCC AGT 
Gly Ser Ser 
170 

GGT GAC ATG 
Gly Asp Met 
185 

GAC ATG ATC 
Asp Met He 



GAA GCA TGG 
Glu Ala Trp 



TGG AAT GTT 
Trp Asn Val 
235 

CAA TTC ATT 
Gin Phe He 
250 

CTT TGT GGA 
Leu Cys Gly 
265 

CAC AGT ATC 
His Ser He 



TTG GTA ACA 
Leu Val Thr 



AGA GAT GGA 
Arg Asp Gly 
315 

GAA TTC GCA 
Glu Phe Ala 
330 

AAT TTC TTT 
Asn Phe Phe 
345 

ATG GGC GAT 
Met Gly Asp 



TGC CAT GAT 
Cys His Asp 



CAA TTC ATT 

Gin Phe He 
140 

GAT GGT TTT 

Asp Gly Phe 



CTG TGG GAT 
Leu Trp Asp 



ATC ACA ACA 
He Thr Thr 
190 

AGC AAT GAC 
Ser Asn Asp 
205 

GAT GCA GGA 
Asp Ala Gly 
220 

TGG TCT GAG 
Trp Ser Glu 



AAA GGC ACT 
Lys Gly Thr 



AGT CCA CAC 
Ser Pro His 
270 

AAC TTT GTA 
Asn Phe Val 
285 

TAT AAT AAC 
Tyr Asn Asn 
300 

GAA AAT CAC 
Glu Asn His 



AGA TTG TCT 
Arg Leu Ser 



GTT TGT CTC 
Val Cys Leu 
350 

GAA TAT GGC 
Glu Tyr Gly 
365 

TCT TAT GTC 
Ser Tyr Val 
380 



GTA GAT 43 2 
Val Asp 



CGT TTT 4 80 
Arg Phe 
160 

CCA GTT 52 8 

Pro Val 

175 

GGG ACA 57 6 
Gly Thr 



CCA ATT 624 
Pro He 



GGC CTC 67 2 
Gly Leu 



TGG AAT 720 
Trp Asn 
240 

GAT GGA 76 8 

Asp Gly 

255 

CTA TAC 816 
Leu Tyr 



TGT GCA 864 
Cys Ala 



AAG TAC 912 
Lys Tyr 

AAT CTT 960 
Asn Leu 
320 

GTC AAA 100 8 

Val Lys 

335 

ATG GTT 1056 
Met Val 



CAC ACA 1104 
His Thr 



AAT TAT 1152 
Asn Tyr 
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Aw * Z GG GAT AAA AAA GAA CAA TAC TCT GAC TTG CAC AGA TTC TGC 1200 
Phe Arg Trp Asp Lys Lys Glu Gin Tyr Ser Aso Leu His Arg Phe Cys 
385 390 395 400 

TGC CTC ATG ACC AAA TTC CGC AAG GAG TGC GAG GGT CTT GGC CTT GAG 124 8 
Cys Leu Met Thr Lys Phe Arg Lys Glu Cys Glu Gly Leu Gly lIu Su 
4 °5 410 415 

G AC TTT CCA ACG GCC GAA CGG CTG CAG TGG CAT GGT CAT CAG CCT GGG 12 9 6 
Asp Phe Pro Thr Ala Glu Arg Leu Gin Trp His Gly His Gin Pro Gly 
420 425 43Q 

1 C ^ ? AT ^ TCT GAG AGC CGA TTC GTT GCC TTT TCC ATG AAA 1344 

15 Lys Pro Asp Trp Ser Glu Asn Ser Arg Phe Val Ala Phe Ser Met Lys 
435 440 445 

GAT GAA AGA CAG GGC GAG ATC TAT GTG GCC TTC AAC ACC AGC CAC TTA 13 92 

90 P G l* ° ln Gly G1U Ile ^ Val Ala Phe Asn Thr Ser His Leu 

ZU 450 45S 460 

CCG GCC GTT GTT GAG CTC CCA GAG CGC GCA GGG CGC CGG TGG GAA CCG 
Pro Ala Val Val Glu Leu Pro Glu Arg Ala Gly Arg Arg Trp Glu Pro 
25 465 470 4 ?5 480 

GTG GTG GAC ACA GGC AAG CCA GCA CCA TAT GAC TTC CTC ACC GAC GAC 1488 
Val Val Asp Thr Gly Lys Pro Ala Pro Tyr Asp Phe Leu Thr Asp- Asp 
4 85 490 495 

TTA CCT GAT CGC GCT CTC ACC ATA CAC CAG TTC TCT CAT TTC CTC AAC 153 6 
Leu Pro Asp Arg Ala Leu Thr Ile His Gin Phe Ser His Phe Leu Asn 
5 °0 505 5io 

TCC AAC CTC TAC CCC ATG CTC AGC TAC TCA TCG GTC ATC CTA GTA TTG 1584 
Ser Asn Leu Tyr Pro Met Leu Ser Tyr Ser Ser Val Ile Leu Val Leu 
515 520 525 

CGC CCT GAT GTT TGA GAG ACA AAT ATA TAC AGT AAA TAA TAT GTC TAT 163 2 

Lys 
540 



30 



35 



45 



1440 



An ^ ASP Val * Glu Thr Asn Ile ^ Ser Lys *. Tyr Val Tyr 

530 535 J 



ATG TAG TCC TTT GGC GTA TTA TCA GTG TGC ACA ATT GCT CTA TTG CCA 1680 
Met * Ser Phe Gly Val Leu Ser Val Cys Thr Ile Ala Leu Leu Pro 
545 550 555 560 

GTG ATC TAT TCG ATA GCG GCC GCG AA 1706 
Val Ile Tyr Ser Ile Ala Ala Ala 
565 



5 0 (2) INFORMATION FOR SEQ ID NO: 1 7- 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 9289 base pairs 

(B) TYPE: nucleic acid 

(C) STR ANDEDNESS : single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

60 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: triticum tauschii 
(F) TISSUE TYPE: Endosperm 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: I ..9289 

(D) OTHER INFORMATION :/product= "genomic sequence of DBE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGG GAC CGT CCC TTG GCA ACT TGG GTT ACG TTG GGA CCT GAC GCT TCG 4 8 
Arg Asp Arg Pro Leu Ala Thr Trp Val Thr Leu Gly Pro Asp Ala Ser 
570 575 580 

15 CTT ATC CGG TGT GCC CTG AGA CGA GAT ATG TGC AGC TCC TAT CGG ATT 96 
Leu He Arg Cys Ala Leu Arg Arg Asp Met Cys Ser Ser Tyr Arg lie 
585 590 595 600 

TGT CGG CAC ATT CGG CGG CTT TGC TGG TCT TGT TTT ACC ATT GTC GAA 144 

2 0 Cys Arg His lie Arg Arg Leu Cys Trp Ser Cys Phe Thr lie Val Glu 

605 610 615 

ATG TCT TAT AAA CCG GGA TTC CGA GAC TGA TCG GGT CTT CCC GGG AGA 192 
Met Ser Tyr Lys Pro Gly Phe Arg Asp * Ser Gly Leu Pro Gly Arg 
25 620 625 630 

AGG TTT ATC CTT CGT TGA CCG TGA GAG CTT ATA ATG GGC TAA GTT GGG 240 

Arg Phe lie Leu Arg * Pro * Glu Leu lie Met Gly * Val Gly 

635 640 645 

30 

ACA CCC CTG CAG GGT ATT ATC TTT CGA AAG CCG TGC CCG CGG TTA TGA 2 88 

Thr Pro Leu Gin Gly lie lie Phe Arg Lys Pro Cys Pro Arg Leu * 
650 655 660 

3 5 GGC AGA TGG GAA TTT GTT AAT GTC CGA TTG TAG AGA ACC TGT CAC TTG 33 6 

Gly Arg Trp Glu Phe Val Asn Val Arg Leu * Arg Thr Cys His Leu 
665 670 675 680 

ACT TAA TTT AAA ATT CAT CAA CCG TGT GTG TAG CCG TGA TGG TCT CTT 384 

4 0 Thr * Phe Lys lie His Gin Pro Cys Val * Pro * Trp Ser Leu 

685 690 695 

TTC GGC GGA GTC CGG GAA GTG AAC ACG GTT TGA GTT ATG CAT GAA CGT 432 
Phe Gly Gly Val Arg Glu Val Asn Thr Val * Val Met His Glu Arg 
45 700 705 710 

AAG TAG TTT CAG GAT CAC TCC TTG ATC ACT TCT AGC TCC GCG ACC GTT 48 0 

Lys * Phe Gin Asp His Ser Leu lie Thr Ser Ser Ser Ala Thr Val 

715 720 725 

50 

GCG TTG TTT CTC TTC TCG CTC TCA TTT GCG TAT GTT AGC CAC CAT ATA 52 8 

Ala Leu Phe Leu Phe Ser -Leu Ser Phe Ala Tyr Val Ser His His lie 

730 735 740 

55 TGC TTA GTG TCT GCT GCA GCT CCA CCT CAT TAC CCC TTC CTT TCC TAT 57 6 
Cys Leu Val Ser Ala Ala Ala Pro Pro His Tyr Pro Phe Leu Ser Tyr 
745 750 755 760 

AAG CTT AAA TAG TCT TGA TCT CGC GGG TGT GAG ATT GCT GAG TCC TCG 62 4 
60 Lys Leu Lys * Ser * Ser Arg Gly Cys Glu lie Ala Glu Ser Ser 

765 770 775 
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TGA CTT ACA GAT TCT ACC AAA ACA GTT GCA GGT GTC GAC GAT GCC AGT 672 
* Leu Thr Asp Ser Thr Lys Thr Val Ala Gly Val Asp Asp Ala Ser 
7 8° 785 790 

GCA GGT GAC GCA ACC GAG CTC AAG TGG GAG TTC GAC GAG GAA CGT GGT 720 
Ala Gly Asp Ala Thr Glu Leu Lys Trp Glu Phe Asp Glu Glu Arg Gly 
795 800 805 

CGT TAC TAT GTT TCT TTT CCT GAT GAT CAG TAG TGG AGC CCA GTT GGG 7 68 
Arg Tyr Tyr Val Ser Phe Pro Asp Asp Gin * Trp Ser Pro Val Gly 
810 815 820 

ACG ATC GGG GAT CTA GCA TTT GGG GTT ATC TTA ATT TCT TTT AGA TTT 816 

i c 1-5 116 Gly Asp Leu Ala Phe G1 y Val Ile Leu Ile Ser Phe Arg Phe 
15 825 830 • 835 840 

GAC CGT AAT CGG TCT ATG TGT GGA TTT TGG ATG ATG TAT GAA TTA TTT 864 
Asp Arg Asn Arg Ser Met Cys Gly Phe Trp Met Met Tyr Glu Leu Phe 

20 845 850 855 

ATG TAT TGT GTG AAG TGG CGA TTG TAA GCC AAC TCT CGT TAT CCC ATT 912 
Met Tyr Cys Val Lys Trp Arg Leu * Ala Asn Ser Arg Tyr Pro Ile 
860 865 870 

25 CTT GTT CAT TAC ATG GGA TTG TGT GAA GAT GAC CCT TCT TGC GAC AAA 
Leu Val His Tyr Met Gly Leu Cys Glu Asp Asp Pro Ser Cys Asp Lys 
875 880 885 

ACC ACA ATG CGG TTA TGC CTC TAA GTC GTG CCT CGA CAC GTG GGA GAT 
JU Thr Thr Met Arg Leu Cys Leu * Val Val Pro Arg His Val Gly Asp 
890 895 900 



50 



— — -.->- *^-v. nuu .rvrvvj «v_v_ 

„ noc Trp Ser Hls Ala Pr o He Ala Glu Ile Ser Glu Arg Lys Thr Gly 

D _> 985 990 



60 



960 



1008 



ATA GCC GCA TCG TGG GCG TTA CAC GCA AGT CTT CAT AGC AAC CAA AAC 1056 

lit Ala Ala Ser Trp Ala Leu His Ala Ser Leu His Ser Asn Gin Asn 
35 905 910 915 920 

TCC TCT CCG CAT TAC AAG CCA CCA ATC GCA GCC ACC ATG ACT TTC TTC 1104 
Ser Ser Pro His Tyr Lys Pro Pro Ile Ala Ala Thr Met Thr Phe Phe 
40 925 930 "5 

ACC ACT GTC AAT GCC ATG AAA ATC TAT ATG TAG ACA TGT CCC ATT GCA 1152 
Thr Thr Val Asn Ala Met Lys Ile Tyr Met * Thr Cys Pro Ile Ala 
940 945 950 

45 TCG GCA AGA AAG CGA AGC TTC ACG GCA CAC CTT CAT GAA GCC TCT CTG 
Ser Ala Arg Lys Arg Ser Phe Thr Ala His Leu His Glu Ala Ser Leu 
955 960 965 



1200 



GCC GAA GAC AAG GAT GCG CCC GAC CGG ATC AAT TCC TAT CTA GAT ACC 124 8 
Ala Glu Asp Lys Asp Ala Pro Asp Arg Ile Asn Ser Tyr Leu Asp Thr 
970 975 980 



™ G ^ AT ?? G ? CA ATA GCG GAG ATC TCC GA G AGG AAG ACC GGA 12 9 6 

™ " " " * " " " Ser Glu Arg Lys Thr Gly 

"5 1000 



ACT CGT CGG ACG TCG GCG TCC AAA TCG AGG AGG CCG GCA TGA AGC ACA 134 4 
Thr Arg Arg Thr Ser Ala Ser Lys Ser Arg Arg Pro Ala * Ser Thr 
1Q 05 1010 1015 

TCG AGG ATG GTG ATC CCC ATA CGG GTA GAT CGG GTC GGC CGC CAT CTC 13 9 2 
Ser Arg Met Val lie Pro Ile Arg Val Asp Arg Val Gly Arg His Leu 
1020 1025 1030 



WO 99/14314 



PCT/AU98/00743 



- 105 - 

AC A CCG AGA TTA GGA TGC TTA AAA CGG TTT TTT TGG CAC TAG CAT TAT 144 0 

Thr Pro Arg Leu Gly Cys Leu Lys Arg Phe Phe Trp His * His Tyr 
1035 1040 1045 

5 TTT GCA TCA TCC GTT GGA GAG AAC ATG AGA GAG CCC CAT TTC TTC CAC 14 88 
Phe Ala Ser Ser Val Gly Glu Asn Met Arg Glu Pro His Phe Phe His 
1050 1055 1060 

GGT TCT ACC TAT GGG ATC TTG TTC TGC TTG CAA CCG GGC CTC ACG GAA 153 6 
10 Gly Ser Thr Tyr Gly lie Leu Phe Cys Leu Gin Pro Gly Leu Thr Glu 
1065 1070 1075 1080 

AAC CCG CGC CAG CGG ACC CAC CCC ATG CTA GCA GGG CAC GGC ACC CGC 15 84 
Asn Pro Arg Gin Arg Thr His Pro Met Leu Ala Gly His Gly Thr Arg 
15 1085 1090 1095 

AGC GGC CGG TCC AAA TGG ACG GTG AGA ACC GCA ACG CGA CAC GCC CGG 163 2 

Ser Gly Arg Ser Lys Trp Thr Val Arg Thr Ala Thr Arg His Ala Arg 
1100 1105 1110 

20 

CAC TGT CAG CAA AGC GAG AGC GCG CGC ACG GCA CAC GCA CGC TCG GAC 1680 

His Cys Gin Gin Ser Glu Ser Ala Arg Thr Ala His Ala Arg Ser Asp 
1115 1120 1125 

2 5 GAA CGG ACG GTG CGA TCG ATC CCT CCC CCC TCG CTC AAC CAC AGT AGT 17 2 8 
Glu Arg Thr Val Arg Ser lie Pro Pro Pro Ser Leu Asn His Ser Ser 
1130 1135 1140 

ACC CTG CCA CAC TAT CAC GCA CGC ACT CGA GTC ACA CCT CCC ACG AAG 177 6 
% J1 3 0 Thr Leu Pro His Tyr His Ala Arg Thr Arg Val Thr Pro Pro Thr Lys 

1145 1150 1155 1160 

Q AAC CAA CAG GAG GCG CGG ATC CCA CCG ATA AAT AAC CCC GCC TCG CCG 182 4 

m Asn Gin Gin Glu Ala Arg lie Pro Pro lie Asn Asn Pro Ala Ser Pro 

q 35 1165 1170 1175 

j=f CTC CTC CCC AAA ATC AAT CAC CGA TCG CTC GGG GTT CCC GGC ATG ACG 1872 

y Leu Leu Pro Lys lie Asn His Arg Ser Leu Gly Val Pro Gly Met Thr 

Q 1180 1185 1190 

40 

ATG ATG GCC ATG GCC AAG GCG CCC TGC CTC TGC GCG CGC CCG TCC CTC 192 0 

Met Met Ala Met Ala Lys Ala Pro Cys Leu Cys Ala Arg Pro Ser Leu 
1195 1200 1205 

4 5 GCC GCG CGC GCG AGG CGG CCG GGG CCG GGG CCG GCG CCG CGC CTG CGA 19 6 8 

Ala Ala Arg Ala Arg Arg Pro Gly Pro Gly Pro Ala Pro Arg Leu Arg 
1210 1215 1220 

CGG TGG CGA CCC AAT GCG ACG GCG GGG AAG GGG GTC GGC GAG GTG TGC 2 016 

5 0 Arg Trp Arg Pro Asn Ala Thr Ala Gly Lys Gly Val Gly Glu Val Cys 

1225 1230 1235 1240 

GCC GCG GTT GTC GAG GCG GCG ACG AAG GCC GAG GAT GAG GAC GAC GAC 206 4 
Ala Ala Val Val Glu Ala Ala Thr Lys Ala Glu Asp Glu Asp Asp Asp 
55 1245 1250 1255 

GAG GAG GAG GCG GTG GCG GAG GAC AGG TAC GCG CTC GGC GGC GCG TGC 2112 
Glu Glu Glu Ala Val Ala Glu Asp Arg Tyr Ala Leu Gly Gly Ala Cys 
1260 1265 1270 



60 



AGG GTG CTC GCC GGA ATG CCC GCG CCG CTG GGC GCC ACC GCG CTC GCC 2160 
Arg Val Leu Ala Gly Met Pro Ala Pro Leu Gly Ala Thr Ala Leu Ala 
1275 1280 1285 
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GGC GGG GTC AAT TTC GCC GTC TAC TCC GGT GGA GCC ACC GCC GCG GCG 2 2 08 

Gly ?in~ Val ASn Phe Ala Val Tyr Ser G1 V G1 y Ala Thr Ala Ala Ala 
1290 1295 1300 

5 CTC TGC CTC TTC ACG CCA GAA GAT CTC AAG GCG GTG GGG TTG CCT CCC 2 2 56 
Leu Cys Leu Phe Thr Pro Glu Asp Leu Lys Ala Val Gly Leu Pro Pro 
1305 1310 1315 1320 

GAG TAG AGT TCA TCA GCT TTG CGT GCG CCG CGC GCC CCC TTT TCT GGC 23 04 
1U Glu * Ser Ser Ser Ala Leu Arg Ala Pro Arg Ala Pro Phe Ser Gly 

!325 1330 1335 

CTG CGA TTT AAG TTT TGT ACT GGG GGA AAT GCT GCA GGA TAG GGT GAC 23 52 
Leu Arg Phe Lys Phe Cys Thr Gly Gly Asn Ala Ala Gly - Gly Asp 
15 1340 1345 y 1350 

GGA GGA GGT TTC CCT TGA CCC CCT GAT GAA TCG GAC TGG GAA CGT GTG 2400 
- Gly Gly Gly Phe Pro * Pro Pro Asp Glu Ser Asd Trp Glu Arg Val 
2Q 1355 1360 * 1365 

GCA TGT CTT CAT TGA AGG CGA GCT GCA CGA CAT GCT TTA CGG GTA CAG 2448 
Ala Cys Leu His * Arg Arg Ala Ala Arg His Ala Leu Arg Val Gin 
!370 1375 1380 

25 GTT CGA CGG CAC CTT TGC TCC TCA CTG CGG GCA CTA CCT TGA TAT TTC 2496 
Val Arg Arg His Leu Cys Ser Ser Leu Arg Ala Leu Pro * Tyr Phe 
1385 1390 1395 1400 

CAA TGT CGT GGT GGA TCC TTA TGC TAA GGT GAT CAT ACT TTA GCT TTA 2544 
Gin Cys Arg Gly Gly Ser Leu Cys * Gly Asp His Thr Leu Ala Leu 
1405 1410 1415 

CCT GCA TCT TGG TAT TTA CAG TAG AAA TTG TTA CGT GGA CCC TTA TTT 2592 
Pro Ala Ser Trp Tyr Leu Gin * Lys Leu Leu Arg Gly Pro Leu Phe 
35 1420 1425 1430 

GTT GCC TTT TGT GTT GCT CTA GGC AGT GAT AAG CCG AGG GGA GTA TGG 2 640 
Val Ala Phe Cys Val Ala Leu Gly Ser Asp Lys Pro Arg Gly Val Trp 
4Q 1435 1440 1445 

CGT TCC GGC GCG TGG TAA CAA TTG CTG GCC TCA GAT GGC TGG CAT GAT 2 688 
. Arg Ser Gly Ala Trp • Gin Leu Leu Ala Ser Asd Gly Trp His Asp 
1450 1455 1460 

45 CCC TCT TCC ATA TAG CAC GGT ATG CCT GAT TGC TGA AAA TAT TGG CTG 273 6 
Pro Ser Ser lie * His Gly Met Pro Asp Cys - Lys Tyr Trp Leu 
1465 1470 1475 1480 



30 



50 



CAT TTG TTT CTC TCT TTT TCT CAT ATT TTT CTC CTG TCT TTC ACT TGT 2 7 84 
His Leu Phe Leu Ser Phe Ser His lie Phe Leu Leu Ser Phe Thr Cys 
1485 1490 1495 

ACT AC A TTG CCT CAG ACA GTC ATG ATC AAA GAG AGC AGT GTC ATT AGA 2 832 
Thr Thr Leu Pro Gin Thr Val Met He Lys Glu Ser Ser Val He Arg 
55 1500 150 5 1510 

CAT TTG TAG TTG TCT GCT GAC TTT GAC CAA AAC TTG TAA TTT ACT GTT 2 880 
His Leu * Leu Ser Ala Asp Phe Asp Gin Asn Leu * Phe Thr Val 
6Q 1515 1520 1525 

GTT AAA GGT CCT TGA ATC ATA TTT TTT TAT AAT ATT ATG TTT GCA AGT 2 92 8 
Val Lys Gly Pro * He He Phe Phe Tyr Asn He Met Phe Ala Ser 
1530 1535 i 540 
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GGA AGT AAA GTG AAA TTG CAT CTA GTA TTT GTT GTT GCT GTC TTA GTC 29 7 6 
Gly Ser Lys Val Lys Leu His Leu Val Phe Val Val Ala Val Leu Val 
1545 1550 1555 1560 

5 GTT TAA TTG GAC ATG CAG TAA AAA GGT TTG CAT CTG CAG TTT GAT TGG 302 4 
Val * Leu Asp Met Gin * Lys Gly Leu His Leu Gin Phe Asp Trp 
1565 1570 1575 

GAA GGC GAC CTA CCT CTA AGA TAT CCT CAA AAG GAC CTG GTA ATA TAT 307 2 
10 Glu Gly Asp Leu Pro Leu Arg Tyr Pro Gin Lys Asp Leu Val lie Tyr 

1580 1585 1590 

GAG ATG CAC TTG CGT GGA TTC ACG AAG CAT GAT TCA AGC AAT GTA GAA 312 0 
Glu Met His Leu Arg Gly Phe Thr Lys His Asp Ser Ser Asn Val Glu 
15 1595 1600 1605 

CAT CCG GGT ACT TTC ATT GGA GCT GTG TCG AAG CTT GAC TAT TTG AAG 3168 

His Pro Gly Thr Phe lie Gly Ala Val Ser Lys Leu Asp Tyr Leu Lys 
1610 1615 1620 

20 

GTA CAG CTG TAC TTG CTG ACT AC A TAG GAT AAT TTT TAA AGA AAG CTA 3216 

ja - Val Gin Leu Tyr Leu Leu Thr Thr * Asp Asn Phe * Arg Lys Leu 

^ 1625 1630 1635 1640 

IJi 2 5 CAT ATT AGC CAG AAT TTG GGT TAT TAC AAA AAC TAC TGC ATA CTA TAG 32 6 4 

Q His lie Ser Gin Asn Leu Gly Tyr Tyr Lys Asn Tyr Cys lie Leu * 

|g 1645 1650 1655 

!f! CAG TTA CAT GCT CAT TAT CGA GGA GAT GCT CAC ACG CAT CTT ATT TGG 3312 

3 0 Gin Leu His Ala His Tyr Arg Gly Asp Ala His Thr His Leu lie Trp 
'%! 1660 1665 1670 

Q ATT TAA TAC CCA ATT CTG TTT TGA TAT TGG ACT GTT CCC TCT ACA GGA 33 60 

He * Tyr Pro He Leu Phe * Tyr Trp Thr Val Pro Ser Thr Gly 
!l 3 35 1675 1680 1685 

s — : 

GCT TGG AGT TAA TTG TAT TGA ATT AAT GCC CTG CCA TGA GTT CAA CGA 340 8 
O Ala Trp Ser * Leu Tyr * He Asn Ala Leu Pro * Val Gin Arg 

i=l 1690 1695 1700 

40 

GCT GGA GTA CTC AAC CTC TTC TTC CAA GTA AGG ACA TGA ATT TAG TAT 3456 
Ala Gly Val Leu Asn Leu Phe Phe Gin Val Arg Thr * He * Tyr 
1705 1710 1715 1720 

45 TAG CCT GCC AGC ACT GTT TGA GTG AGA GTT CAT ACA CAT TTT GTG CCT 3 504 
* Pro Ala Ser Thr Val * Val Arg Val His Thr His Phe Val Pro 
1725 1730 1735 

GCA TAA CTG ATA TTT GTT CAA ACT ATT TTT TTT AGC AGT CAC TCA ACA 3 552 
5 0 Ala * Leu He Phe Val Gin Thr He Phe Phe Ser Ser His Ser Thr 

1740 1745 1750 

GTT TTA CAT ATA TAT ATA ATA TAG ACT ATT CGT CAC CCT GGG TGA GGA 3 6 00 
Val Leu His He Tyr He He * Thr He Arg His Pro Gly * Gly 
55 1755 1760 . 1765 

ATA GTT ATT CTT CAC CCA CCT CTA TTT TAA CAT CTA TGC ACC GTA ATT 3 64 8 
He Val He Leu His Pro Pro Leu Phe * His Leu Cys Thr Val He 
1770 1775 1780 



60 



TTA CGT TTC GTA AAT TTG TCT TAT TTT AGA GAT AAA AAG AGA ACG TAA 3 696 
Leu Arg Phe Val Asn Leu Ser Tyr Phe Arg Asp Lys Lys Arg Thr * 
1785 1790 1795 1800 
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GAA AAC CTA TAA TCG TCG TAA AAA AAA ATA TGT TAC GTA AAA TTA CAA 3744 
Glu Asn Leu * Ser Ser * Lys Lys lie Cys Tyr Val Lys Leu Gin 
1805 1810 1815 

5 ATG TAA AAA CAT AGT GTA AAA TGT ACA TAA AAT AC A TTT TTT GAC CTA 3 7 92 
Met * Lys His Ser Val Lys Cys Thr * Asn Thr Phe Phe Asd Leu 
1820 1825 1830 

TAT TTT TTT TGT TAA TGC CAA ATT TTA TAC AGT AAA TCA ATA TGA ATG 3 840 
10 Tyr Phe Phe Cys * Cys Gin He Leu Tyr Ser Lys Ser He * Met 
1835 1840 1845 

TAA CTA TTT GTA TTT CAA ATG TAA TTT ATT TAT GAA ATG GTC GTA AG A 3 88 8 

* Leu Phe Val Phe Gin Met * Phe He Tyr Glu Met Val Val Arg 
15 1850 1855 1860 

TTA CCT CGG GTG AAG AAT AAC TTA TTC TGC ACC CTG GGT GAT GAA TAG 3 93 6 
Leu Pro Arg Val Lys Asn Asn Leu Phe Cys Thr Leu Gly Asp Glu * 
1865 1870 1875 1880 

20 

TAA CAC TAT ATA TAT ATA TAT ATA TAT ATA TAT ATA TAT ATA CCG GCT 3 9 84 

* His Tyr He Tyr He Tyr He Tyr He Tyr He Tyr He Pro Ala 
1885 1890 1895 

2 5 GCT GCT AAT GAT GTT AAT ATT TCG CAA GTA CCT AAG CTG GAT TTT TCT 4 03 2 
Ala Ala Asn Asp Val Asn He Ser Gin Val Pro Lys Leu Asp Phe Ser 
O 1900 1905 1910 

iVl CCA TGA GAC ATC CCA TAA TTG AAA TTG GTC ACG ACA GTT GAA TAG 4080 

?*! 3 0 * Asp He Asn Pro * Leu Lys Leu Val Thr Thr Val Glu * 

y % 1915 1920 1925 

TTG ATA GCT GAA AAT GAA ATC CAG CAT GCT ACT GTC TTG CCA TCT CCA 412 8 
Q Leu H e A la Glu Asn Glu He Gin His Ala Thr Val Leu Pro Ser Pro 

;~ 35 1930 1935 1940 



sjl 



40 



GAC TTG CTA ACA TGA ATT TTG TCT GCC TAC CTG TCA TTT GTA CCA ACG 417 6 
Asp Leu Leu Thr * He Leu Ser Ala Tyr Leu Ser Phe Val Pro Thr 
1945 1950 1955 I960 

TTC CCA ATT GCC CTC TCA TTA TTC GTG TGT ACC ATG CAT ATG TGT TTT 4 224 
Phe Pro He Ala Leu Ser Leu Phe Val Cys Thr Met His Met Cys Phe 
1965 1970 1975 

4 5 AAC ATG ATT ATT GTT GGC TAT ATT TCT CTT TGG AAA CAT GAC TAA TTT 4272 
Asn Met He He Val Gly Tyr He Ser Leu Trp Lys His Asp * Phe 
1980 1985 1990 

ATC ACC CGT TTT GTA TAA ACT GCT TGT TTT CAT ATC AGG ATG AAC TTT 43 2 0 
50 lie Thr Arg Phe Val * Thr Ala Cys Phe His He Arg Met Asn Phe 
1995 2000 2005 

TGG GGA TAT TCT ACC ATA AAC TTC TTT TCA CCA ATG ACG AGA TAC ACA 43 6 8 
Trp Gly Tyr Ser Thr He Asn Phe Phe Ser Pro Met Thr Arg Tyr Thr 
55 2010 2015 2020 



60 



TCA GGC GGG ATA AAA AAC TGT GGG CGT GAT GCC ATA AAT GAG TTC AAA 4 416 

Ser Gly Gly He Lys Asn Cys Gly Arg Asp Ala He Asn Glu Phe Lys 
2025 2030 2035 2040 

ACT TTT GTA AGA GAG GCT CAC AAA CGG GGA ATT GAG GTA AGC AAG TCG 4 4 64 

Thr Phe Val Arg Glu Ala His Lys Arg Gly He Glu Val Ser Lys Ser 
2045 2050 2055 
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TAC GAG TTA GTT GCT CCT TTT GAA CTT ATC AAT TTG ATG CGA AG A CAT 4 512 

Tyr Glu Leu Val Ala Pro Phe Glu Leu lie Asn Leu Met Arg Arg His 
2060 2065 2070 

5 GTT ACT GCT AGG TGA TCC TGG ATG TTG TCT TCA ACC ATA CAG CTG AGG 4 560 
Val Thr Ala Arg * Ser Trp Met Leu Ser Ser Thr lie Gin Leu Arg 
2075 2080 2085 

GTA ATG AG A ATG GTC CAA TAT TAT CAT TTA GGG GGG TCG ATA ATA CTA 4 608 
10 Val Met Arg Met Val Gin Tyr Tyr His Leu Gly Gly Ser lie lie Leu 
2090 2095 2100 

CAT ACT ATA TGC TTG CAC CCA AGG TGA CAG ATC TTT CTT GCT GCG TAA 4656 
His Thr He Cys Leu His Pro Arg * Gin He Phe Leu Ala Ala * 
15 2105 2110 2115 2120 



20 



TTG TTC TTT CAT AGA TGT ATA GAG CAT AGA TGT GTT ATG TAG TAG TTC 4704 
Leu Phe Phe His Arg Cys He Glu His Arg Cys Val Met * * Phe 
2125 2130 2135 

TTT TTC AAG GGG ATT ATG TTC ATG CAG GGA GAG TTT TAT AAC TAT TCT 47 52 
Phe Phe Lys Gly He Met Phe Met Gin Gly Glu Phe Tyr Asn Tyr Ser 
2140 2145 2150 

2 5 GGC TGT GGG AAT ACC TTC AAC TGT AAT CAT CCT GTG GTT CGT CAA TTC 4 800 

Gly Cys Gly Asn Thr Phe Asn Cys Asn His Pro Val Val Arg Gin Phe 
2155 2160 2165 

ATT GTA GAT TGT TTA AGG TAC AGA TAT ACA TTT TAC TTC TAG AAC TAC 4 84 8 

3 0 He Val Asp Cys Leu Arg Tyr Arg Tyr Thr Phe Tyr Phe * Asn Tyr 

2170 2175 2180 

TTT TTC ATT TCT TTT GCT GCT TGT CAT TTT GAT ATG ATT AAT TTG CAA 4 89 6 
Phe Phe He Ser Phe Ala Ala Cys His Phe Asp Met He Asn Leu Gin 
35 2185 2190 2195 2200 

GCT TGT GGG GGT AAA TCT TTT GGT CAG CAT ATT GTA TCT TTA AAT GTC 49 44 

Ala Cys Gly Gly Lys Ser Phe Gly Gin His He Val Ser Leu Asn Val 

2205 2210 2215 

40 

ACA AAT ACT AAT GTC CTG GTG CTT ATT GAT TTG GCA TCT TCA AAT TCT 4 992 

Thr Asn Thr Asn Val Leu Val Leu He Asp Leu Ala Ser Ser Asn Ser 

2220 2225 2230 

4 5 TCT CCA ATG AAA AGG GAA AAA TCT ACT GTA TGT CTC GTC AAC TAA TTT 5040 

Ser Pro Met Lys Arg Glu Lys Ser Thr Val Cys Leu Val Asn * Phe 
2235 2240 2245 

ACT TTT GTT TTG CAG ATA CTG GGT GAT GGA AAT GCA TGT TGA TGG TTT 50 88 
50 Thr Phe Val Leu Gin He Leu Gly Asp Gly Asn Ala Cys * Trp Phe 
2250 2255 2260 

TCG TTT TGA TCT TGC ATC CAT AAT GAC CAG AGG TTC CAG GTA ATT TGT 513 6 
Ser Phe * Ser Cys He His Asn Asp Gin Arg Phe Gin Val He Cys 
55 2265 2270 2275 2280 

ATT TAT TGT TTG TTT GCG TGT TGC CTT TTC AGA AGA TTC TTA AAA GAA 5184 
He Tyr Cys Leu Phe Ala Cys Cys Leu Phe Arg Arg Phe Leu Lys Glu 
2285 2290 2295 



60 



TGT TTC TTT TAC AAG TCT GTG GGA TCC AGT TAA CGT GTA TGG AGC TCC 52 3 2 
Cys Phe Phe Tyr Lys Ser Val Gly Ser Ser * Arg Val Trp Ser Ser 
2300 2305 2310 



WO 99/14314 



PCT/AU98/00743 



- 110 - 



10 



AAT AGA AGG TGA CAT GAT CAC AAC AGG GAC ACC TCT TGT TAC TCC ACC 5280 
Asn Arg Arg * His Asp His Asn Arg Asp Thr Ser Cys Tyr Ser Thr 
2315 2320 2325 

ACT TAT TGA CAT GAT CAG CAA TGA CCC AAT TCT TGG AGG CGT CAA GGT 53 2 8 
Thr Tyr * His Asp Gin Gin * Pro Asn Ser Trp Arg Arg Gin Glv 
2330 2335 2340 

ACT TGT TTC ATC CAA CAC CTG TTG TCT GTG TGC ATT CAA TTG TTT TAA 537 6 
Thr Cys Phe lie Gin His Leu Leu Ser Val Cys He Gin Leu Phe * 
2345 2350 2355 2360 

TAT GGT AAT GAT CAA TTT CCC AAT GTT GAT AAG GAA AAA AAA TGC AAG 5424 
Tyr Gly Asn Asp Gin Phe Pro Asn Val Asp Lys Glu Lys Lys Cys Lys 
15 2365 2370 2375 

TAG CTC TCT TTA TCT GCT TCT TGT GAG TTA TGC TAA ACA TGT AGA TAC 5472 
* Leu Ser Leu Ser Ala Ser Cys Glu Leu Cys * Thr Cys Arg Tyr 
2Q 2380 2385 2390 

TAC TAT ATT TCA ACT GTA TAT ACT TGA CAT ATT ATT GCT TCC TTG GGA 5520 
Tyr Tyr He Ser Thr Val Tyr Thr * His He He Ala Ser Leu Gly 
23 95 2400 2405 

2 5 GGC TCT CTT ATT CCT TTC CCC CGT TGC AAT TAT AGC TCA TTG CTG AAG 5 568 
Gly Ser Leu He Pro Phe Pro Arg Cys Asn Tyr Ser Ser Leu Leu Lvs 



O 2410 2415 2420 



W 30 



40 



CAT GGG ATG CAG GAG GCC TCT ATC AAG TAG GTC AAT TCC CTC ACT GGA 5616 
His Gly Met Gin Glu Ala Ser He Lys * Val Asn Ser Leu Thr Gly 
2425 2430 2435 2440 

ATG TTT GGT CTG AGT GGA ATG GGA AGG TAA GGT ACC TGT TAA AAG TTT 5664 
Met Phe Gly Leu Ser Gly Met Gly Arg * Gly Thr Cys * Lys Phe 
35 2445 2450 2455 

GAA TGG CAA ATA CTG ATA GAA ATA TAA CTT ATA TTT GCG ACA TAT ATA 5712 
Glu Trp Gin He Leu He Glu He * Leu He Phe Ala Thr Tyr He 
2460 2465 2470 

GAT AAA GCA AAA TAA TAC GCA TTC CAC CTG AAC TTT AAA GGG GCA CGC 57 60 
Asp Lys Ala Lys * Tyr Ala Phe His Leu Asn Phe Lys Gly Ala Arg 
2475 2480 2485 

45 AGA ATT ATC CCG CAT CTG TCT ACA AGA ATG ATA ACA CAT GTG CTG AAT 580 8 
Arg He He Pro His Leu Ser Thr Arg Met He Thr His Val Leu Asn 
2490 2495 2500 

AGT GAA GTA CTA CTT CTC AAA TGT CTG AAT GAA CGC ACT AAC TCT TGT 5856 
Ser Glu Val Leu Leu Leu Lys Cys Leu Asn Glu Arg Thr Asn Ser Cys 
2505 2 510 2515 2520 

GAG TGT CAA CCG AGC AAG AAA TAT TTG AGT TTT CTG CAA GAA ATT GTT 5904 
Glu Cys Gin Pro Ser Lys Lys Tyr Leu Ser Phe Leu Gin Glu He Val 
55 2525 2530 2535 



50 



60 



CAT GTT GTG CTG TAT TAT ACT CCC TCC GTC CGA AAT TAT TTG TCG GAG 5 952 
His Val Val Leu Tyr Tyr Thr Pro Ser Val Arg Asn Tyr Leu Ser Glu 
2540 2545 2550 

AAA TGG ATG TAT CTA GAC GTA TTT TAG TTC TAG ATA CAT CCA TTT TTA 6000 
Lys Trp Met Tyr Leu Asp Val Phe * Phe * He His Pro Phe Leu 
2555 2560 2565 
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TCC ATT TCT GCA ACA AGT AGT TCC GGA CGG AGG GAG TAT CAT TTA ACA 6 04 8 
Ser He Ser Ala Thr Ser Ser Ser Gly Arg Arg Glu Tyr His Leu Thr 
2570 2575 2580 

5^ AAT ATA TGC ATG TTC GAA GTA AAT CCC CAC GAA TAA GCA TAT AAG ACG 609 6 
Asn He Cys Met Phe Glu Val Asn Pro His Glu * Ala Tyr Lys Thr 
2585 2590 2595 2600 

ATA TTG CTT TTT GAC TTG CAA CAC CTA AAC CTC ATT GTT TTC TCC TAG 614 4 
10 He Leu Leu Phe Asp Leu Gin His Leu Asn Leu He Val Phe Ser * 

2605 2610 2615 

GAT TTT GGG TGT TCG AAG CAA GCA GCT GGT GAT ATT TAA TTT ACC TTT 619 2 
Asp Phe Gly Cys Ser Lys Gin Ala Ala Gly Asp He * Phe Thr Phe 
15 2620 2625 2630 

GCC TTT ATT TGT AGC TTG ATT TGA GGG TGC GGC AAA GGT TTT AGC TTA 624 0 
Ala Phe He Cys Ser Leu He * Gly Cys Gly Lys Gly Phe Ser Leu 
2635 2640 2645 

20 

GTA GTG TTT TGT AAA TTA TTA TAG TTT ATG TAT ATA CTC CTC ATT TGG 6288 
Val Val Phe Cys Lys Leu Leu * Phe Met Tyr He Leu Leu He Trp 
2650 2655 2660 

2 5 GCA CTT CCG TAC TGG TCC CAT AGA AGA TAA AAA TGG AAT GAT GTC TGG 63 3 6 

Ala Leu Pro Tyr Trp Ser His Arg Arg * Lys Trp Asn Asp Val Trp 
2665 2670 2675 '2680 

CCA ATA ATT GTT GAC AAC ACT GTT GCG CAT TTG ATT TTT ATC AGG GAA 63 84 

3 0 Pro He He Val Asp Asn Thr Val Ala His Leu He Phe He Arg Glu 

2685 2690 2695 

TGG AAA ATT GAA ATC GGT AAG AAA CAT TGC GAT ATT AAG CTT GTA TAT 643 2 
Trp Lys He Glu He Gly Lys Lys His Cys Asp He Lys Leu Val Tyr 
35 2700 2705 2710 

GCT AAT GCT GGT GGA TCT TTA AGA GGG AAC ATA TGA TCT CGT GTG CAT 64 80 
Ala Asn Ala Gly Gly Ser Leu Arg Gly Asn He * Ser Arg Val His 
2715 2720 2725 



40 



60 



CCA TCT TCA ACT AAA AAA ATA TGT TGC ACA TCT CCC ACG TCA CTT ACT 652 8 
Pro Ser Ser Thr Lys Lys He Cys Cys Thr Ser Pro Thr Ser Leu Thr 
2730 2735 2740 



45 AGC TAT TTC ATC CAA GTA CTA ACT TGT GTG GTT GTC TCC TCA GTA CCG 65 7 6 
Ser Tyr Phe He Gin Val Leu Thr Cys Val Val Val Ser Ser Val Pro 
2745 2750 2755 2760 

GGA CAT TGT GCG CCA ATT CAT TAA AGG CAC TGA TGG ATT TGC TGG TGG 6 62 4 
50 Gly His Cys Ala Pro He His * Arg His * Trp He Cys Trp Trp 

2765 2770 2775 

TTT TGC CGA ATG TCT TTG TGG AAG TCC ACA CCT ATA CCA GGT AAG TTG 6 67 2 
Phe Cys Arg Met Ser Leu Trp Lys Ser Thr Pro He Pro Gly Lys Leu 
55 2780 2785 2790 

TGG CAA TAC TTG GAA ATG GGT TGA GTG AAT GTC ACA TGG ATT TTT TAT 67 2 0 
Trp Gin Tyr Leu Glu Met Gly * Val Asn Val Thr Trp He Phe Tyr 
2795. 2800 2805 



ATA TAC CAC ATG ATG ATA CAC ATG TAA ATA TAT AAC GAT TAT AGT GTA 67 6 8 
He Tyr His Met Met He His Met * He Tyr Asn Asp Tyr Ser Val 
2810 2815 2820 
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TGC ATA TGC ATT TGG CTA AGA AGT ACT CCC TCC CTT AGT AAA AGT TAG 6816 
Cys lie Cys He Trp Leu Arg Ser Thr Pro Ser Leu Ser Lvs Ser - 
2S25 2830 2835 - 2 $40 

5 TAC AAA GTT GAG TCA TCT ATT TTG GAA CGG AGG GAG TAT AAG TGT AT* 6864 
Tyr Lys Val Glu Ser Ser He Leu Glu Arg Arg Glu Tyr Lys Cys II- 
2845 2850 2855 

CAC TAG TGC AAT ATA TAG GTT TTA ACA CCC AAC TTG CCA ATG AAG GAA 6912 
10 His * Cys Asn He * Val Leu Thr Pro Asn Leu Pro Met Lys Glu 
2860 2865 2870 

CAT AGG GCT TTC TAG TTA TCT TAT TTA TTT GTC TGG TGA ATA ATC CAC 69 60 
His Arg Ala Phe * Leu Ser Tyr Leu Phe Val Trp * He He His 
lb 2875 2880 2885 

TGA AAA ATT CCA GCC ATG TCA TTT TTT AGG GGG GGA GAA GAA ACT AC- 7 008 
* Lys He Pro Ala Met Ser Phe Phe Arg Gly Gly Glu Glu Thr Th- 
2Q 2890 2895 2900 

TTG ATT TTT CCC CCT AAA AAA AGC CAT CTC AGA TTT CAT AGG TAA CT^ 7056 
Leu He Phe Pro Pro Lys Lys Ser His Leu Arg Phe His Arg * Leu 
2905 2910 2915 * 29 20 

25 GCT TTT CTG TAA AGA AAT GAA AAC GAC TTC ATA CTT TCT GTC GAT TA^ 7104 
Ala Phe Leu * Arg Asn Glu Asn Asp Phe He Leu Ser Val Asp Tyr 
2925 2930 2935 

AAG TGT ATA CAC TAG TGC AAT ATA TAG GTT TTA ACA CCC AAC TTG CCA 7152 
L ^ S Ile His * Cys Asn He * Val Leu Thr Pro Asn Leu Pro 

2940 2945 2950 

ATG AAG GAA CAT AGG GCT TTC TAG TTA TCT TAT TTA TTT GCT GGT GAA 7200 
Met Lys Glu His Arg Ala Phe * Leu Ser Tyr Leu Phe Ala Gly Glu 
35 2955 2960 2965 

TAA TCC ACT GAA AAA TTC CAG CCA TGT CAT TTT TTA GGG GGG AGA AGA 724 8 
* Ser Thr Glu Lys Phe Gin Pro Cys His Phe Leu Gly Gly Arg Arq 
4Q 2970 2975 2980 

AAC TAT ATT GAT TTT TCC CCC TAA AAA AAG CCA TCT CAG ATT CAT AGG 729 6 
Asn Tyr Ile Asp Phe Ser Pro * Lys Lys Pro Ser Gin Ile His Arg 
2985 2990 2995 3000 

45 ..AAC TTG CTT TTC TGT AAA GAA ATG AAA ACG ACT TCA TAC TTT CTG CGG 7344 
Asn Leu Leu Phe Cys Lys Glu Met Lys Thr Thr Ser Tyr Phe Leu Arg 
3005 3010 3015 



50 



CGC TTA CTT AGC TCG ATG GAT ATT TGT AAG ATG AAT GCC AAA TTA TTT 73 92 
Arg Leu Leu Ser Ser Met Asp Ile Cys Lys Met Asn Ala Lys Leu Phe 
3020 3025 3030 

GGC GGG ATT TGA TCG TTA TTC CAA ATT TCA TTT GGT TTC TCT AGC AAT 7440 
Gly Gly He * Ser Leu Phe Gin Ile Ser Phe Gly Phe Ser Ser Asn 
" 3 °35 3040 3045 

CAA CCC AGT ACC TTG TTA TTG GCA CTG CAA TTT CTT ATT GAT TAA TCA 74 88 
Gin Pro Ser Thr Leu Leu Leu Ala Leu Gin Phe Leu He Asp * Ser 
6Q 3050 3055 3060 

GGC AGG AGG AAG GAA ACC TTG GCA CAG TAT CAA CTT GGT ATG TGC ACA 7 53 6 
Gly Arg Arg Lys Glu Thr Leu Ala Gin Tyr Gin Leu Gly Met Cys Th- 
3065 3070 3075 3080 
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TGA TGG ATT TAG ACT GGG TGA TTT GGT ACA TAT AAT ACC AAG TCA ATT 7 584 
* Trp lie Tyr Thr Gly * Phe Gly Thr Tyr Asn Thr Lys Ser He 
3085 3090 3095 

5 TAC CAA ATG GGG AGA CCA ATA GAG ATG GAG AAA ATC ACA ATC TTA GCT 7 63 2 
Tyr Gin Met Gly Arg Pro He Glu Met Glu Lys He Thr He Leu Ala 
3100 3105 3110 

GGA ATT GTG GGG AGG TAA TTC TGA ACT CTC CTT TTT TTT TGA AAT TTT 7 680 
10 Gly He Val Gly Arg * Phe * Thr Leu Leu Phe Phe * Asn Phe 
3115 3120 3125 

CAT GCT TTA CAT AAT AGT CAA ATG GCT GAC AAA TGT CGT TGT ATG GTT 77 2 8 
His Ala Leu His Asn Ser Gin Met Ala Asp Lys Cys Arg Cys Met Val 
15 3130 3135 3140 



20 



CTC TCT ACC TAA ACC GTT AAG GCA GTA AGA GTT TCC CTA CAA GAT CTC 7776 
Leu Ser Thr * Thr Val Lys Ala Val Arg Val Ser Leu Gin Asp Leu 
3145 3150 3155 3160 

TTT GTT CGT ATA ATT GTA TTT TCT AGA GAA AAG TTG CCT TCA ATT TTG 7824 
Phe Val Arg He He Val Phe Ser Arg Glu Lys Leu Pro Ser He Leu 
3165 3170 3175 

2 5 TGC ACG CGG CAG TAC AGG AAT TGT GGT TAT AAA TAT TGA TAC AGG CTG 7 872 

Cys Thr Arg Gin Tyr Arg Asn Cys Gly Tyr Lys Tyr * Tyr Arg Leu 
3180 3185 3190 

ACC ATC GTT ACT AAT AGG GGG AAC AAT AAG CAC ATT TTT TTA ATA GCA 792 0 

3 0 Thr He Val Thr Asn Arg Gly Asn Asn Lys His He Phe Leu He Ala 

3195 3200 3205 

AAG GCA TCA CCC TTG TTC CGT TTC CAA TGA AAT CAC AGT ATC CGA ACC 7968 
Lys Ala Ser Pro Leu Phe Arg Phe Gin * Asn His Ser He Arg Thr 
35 3210 3215 3220 

ATA AGT TTT ACA AGT ATG CGT AGA GAG AAA TAA AGT ATC AAC CCG GCA 8016 

He Ser Phe Thr Ser Met Arg Arg Glu Lys * Ser He Asn Pro Ala 
3225 3230 3235 3240 

40 

GAA ACA GTT GTT TCA GGC GCA AAG AGA AAA GGA AAC GAT ATG CTC TAT 8064 

Glu Thr Val Val Ser Gly Ala Lys Arg Lys Gly Asn Asp Met Leu Tyr 
3245 3250 3255 

4 5 TAC ATC AAC CTT TTA GCA TTT AGG GAC GAC CAG CAT CAT CCC ATC TTC 8112 

Tyr He Asn Leu Leu Ala Phe Arg Asp Asp Gin His His Pro He Phe 
3260 3265 3270 

AAT CAA CTG GAG CGA GGT CAC CTC CAA TCT TCT CAG CAG CCT CAG AGT 8160 
50 Asn Gin Leu Glu Arg Gly His Leu Gin Ser Ser Gin Gin Pro Gin Ser 
3275 3280 3285 

GGT GAC CTC CCA AGC AAG TGC ATC AGC ATC CAT CAT CTG GGG GTT GGG 82 0 8 
Gly Asp Leu Pro Ser Lys Cys He Ser He His His Leu Gly Val Gly 
55 3290 3295 3300 

CAC ATA CCA TGA GCA CAA TCA CCT GAA TTT GAT GAA TTT TCC TCT GTT 82 56 
His He Pro * Ala Gin Ser Pro Glu Phe Asp Glu Phe Ser Ser Val 
3305 3310 3315 3320 



60 



TAC CTT GCA GCA GAC CCC TGC CGT ATA AAT GGT TTT AAA TGA CAG CAT 83 04 
Tyr Leu Ala Ala Asp Pro Cys Arg He Asn Gly Phe Lys * Gin His 
3325 3330 3335 
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GTT CTT TCA GTT TGA GCA AAA TTT GTG CAA TTG CAA AG A AGC TTT AGA 8 3 52 

Val Leu Ser Val * Ala Lys Phe Val Gin Leu Gin Arg Ser Phe Arg 
3340 3345 3350 

5 ATC ATG TGG AAC ATG CAC TTA CAT TTC ATC TGA CAA TAT AGG AAG GAG 84 00 
He Men Trp Asn Met His Leu His Phe He * Gin Tyr Arg Lys Glu 
3355 336O 3365 

AGC CCG ACG TCG CAT GCT CCT CTA GAC TCG AGG AAT TCG CAA GAT TGT 84 4 8 
10 Ser Pro Thr Ser His Ala Pro Leu Asp Ser Arg Asn Ser Gin Asp Cys 
3370 3375 33 8 o 

CTG TCA AAA GAT TGA GGA AGA GGC AGA TGC GCA ATT TCT TTG TTT GTC 84 9 6 
Leu Ser Lys Asp * Gly Arg Gly Arg Cys Ala He Ser Leu Phe Val 
15 3385 3390 3395 3400 



20 



40 



50 



60 



TCA TGG TTT CTC AAG TAA GAC TTA TAT CTG ATC TCT TCA ATT TTT GAG 8 54 4 
Ser Trp Phe Leu Lys * Asp Leu Tyr Leu He Ser Ser He Phe Glu 
3405 3410 3415 

ATT GCC TGT TTT TCA CAA TGG CAT ATG TTG TCA GGT GAA ACA TCC AAT 8 592 
He Ala Cys Phe Ser Gin Trp His Met Leu Ser Gly Glu Thr Ser Asn 
3420 3425 3430 



2 5 CCC AGT ATT AAT AGA GCC AAC ATG AAG GGA TTG CTT ATC TGA GAT ATC 
Pro Ser He Asn Arg Ala Asn Met Lys Gly Leu Leu He * Asp He 
3435 3440 3445 



45 CTA AGT AAC TAT CTT CAA ATC TTT GCA TTC ATC CGT CAT GGC TCT TCT 
Leu Ser Asn Tyr Leu Gin He Phe Ala Phe He Arg His Gly Ser Ser 
3515 3520 3525 



8640 



TGC CAA AGT TGA ATT CTT AGA TTC ACC TTC TTC AGT ATT TCA GAC CTT 8688 
3 0 Cys Gin Ser * He Leu Arg Phe Thr Phe Phe Ser He Ser Asp Leu 
3450 3455 3460 

CTA AGC ATT TTC ATT TTT TTT TTC AAT TGT TAG GGA GTT CCA ATG TTT 8736 
Leu Ser He Phe He Phe Phe Phe Asn Cys * Gly Val Pro Met Phe 
35 3465 3470 3475 3480 

TAC ATG GGC GAT GAA TAT GGC CAC ACA AAA GGG GGC AAC AAC AAT ACA 8784 
Tyr Met Gly Asp Glu Tyr Gly His Thr Lys Gly Gly Asn Asn Asn Thr 
3485 3490 3495 



TAC TGC CAT GAT TCT TAT GTC AGT ACA ATT TGG TCA CAT ATT GTT GTT 883 2 
Tyr Cys His Asp Ser Tyr Val Ser Thr He Trp Ser His He Val Val 
3500 3505 3510 



8880 



GTA GGT CAA TTA TTT TCG CTG GGA TAA AAA AGA ACA ATA CTC TGA CTT 8928 
Val Gly Gin Leu Phe Ser Leu Gly * Lys Arg Thr He Leu * Leu 
3530 3535 3540 



GCA AAG ATT CTG CTG CCT CAT GAC CAA ATT CCG CAA GTA AGT ATT CCG 897 6 
Ala Lys He Leu Leu Pro His Asp Gin He Pro Gin Val Ser He Pro 
55 3545 3550 3555 3560 



TTG AAT AAT TTC TGT GTA GAA CCA CTG AAG GTG CCT CCA AAC GCT AAG 902 4 

Leu Asn Asn Phe Cys Val Glu Pro Leu Lys Val Pro Pro Asn Ala Lys 
3565 3570 3575 

CGA GCA AGG TCA ATT TCA CAC CCT AAT CAA GTT GGT GTT GTC TAT TTG 9 07 2 

Arg Ala Arg Ser He Ser His Pro Asn Gin Val Gly Val Val Tyr Leu 

3580 3585 3590 
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TGT ATT TGA TCT GCT GCA CTG TAG GGA GTG CGA GGG TCT TGG CCT TGA 912 0 

Cys lie * Ser Ala Ala Leu * Gly Val Arg Gly Ser Trp Pro * 
3595 3600 3605 

5 GGA CTT TCC AAC GGC CGA ACG GCT GCA GTG GCA TGG TCA TCA GCC TGG 916 8 
Gly Leu Ser Asn Gly Arg Thr Ala Ala Val Ala Trp Ser Ser Ala Trp 
3610 3615 3620 

GAA GCC TGA TTG GTC TGA GAA TAG CCG ATT CGT TGC CTT TTC CAT GGT 9 216 
10 Glu Ala * Leu Val * Glu * Pro lie Arg Cys Leu Phe His Gly 
3625 3630 3635 3640 

ACA CAT ATA GTT CTG ACA CTT CAC TAT AGT TGT TTT AAA AAA GAA AAT 926 4 
Thr His lie Val Leu Thr Leu His Tyr Ser Cys Phe Lys Lys Glu Asn 
15 3645 3650 3655 

TTA ACT CAA AAG TAA ATT ATG GAG A 9 2 89 

Leu Thr Gin Lys * He Met Glu 
3660 



